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I, Brian W. Ward, declare as follows; 

I am a citizen of the United States of America, and I reside in St. Louis County, Missouri. 

!r,r i ^ tIy C 7 1 ? ycd . at Sigma-Aldrich Co. as a research scientist. I have a substantial 
amount of formal education, training and experience in biochemical and biological 
research, and particularly, in the field of molecular biology, as evidenced by my 
curriculum vitae, attached hereto as Exhibit B. 

lam a co-inyentor of the abovcrefcrenced patent application, and therefore, am familiar 
with the contents and substance of we specification and presenu>pending claims. 

The present invention is directed toward novel compositions comprising a polymerase for 
an ex-vtvo enzymatic reaction in which a nucleic acid polymer product complementary to 
a nucleic acid polymer template is prepared, and a tracer dye compatible with the 
polymerase, the composition being substantially free of the nucleic acid polymer 
template. Particular embodiments of the invention comprise various composition 

SXn Va T S dCnS i ticS * Vari0US dyes ' inc,udin * for cxam Pl°, violet 5. 
T^fymera C ° mbmatl0DS 8nd various Phrases, including, for example, 
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h order to exemplify the invention, one particular embodiment of the invention is 
demonstrated m Example 1, specification p. 18, line 6 et seq. This particular embodiment 
demonstrates the use of Taq polymerase as the polymerase and a combination of 80% 
acid red 1 and 20% acid violet 5 as the tracer dye. 

Taq polymerase is a nucleotidyl transferase - I.e., an enzyme of the transferase group that 
catalyzes the transfer of a nucleotidyl group, such as for example, DNA polymerase or 
RNA polymerase. Such enzymes are classified based upon their template (the strand of 
DNA or RNA that specifies the nucleotide sequence of the newly synthesized product 
strand) anoVreaction products (the resulting DNA or RNA strand that is newly 
synthesized). Of particular importance with respect to the present application are the 
following two classes of polymerases: 

a. DNA-dependcnt DNA polymerases: employed in all cells for replication and 
repair of the DNA 

b. RNA-dependent DNA polymerases: employed by retro viruses in order to copy 
RNA into DNA (these polymerases commonly called reverse transcriptases 
(RT)). 

As of the filing date of the present application, July 6, 2000, it was common knowledge, 
as demonstrated by this Declaration and the references cited herein, that DNA 
polymerases share commonly conserved structural and functional characteristics 

S^Sr; ZT of ordinai y^ kil ! in wc a «> *e disclosure and use of Taq polymerase in 
Example 1 of the present application would be sufficiently descriptive to confer to me 
and others that the Applicants of the present invention were in possession not only of the 
particular embodiment disclosed above, but of alt such embodiments comprising a DNA 
polymerase, and more particularly a thermostable polymerase, and a tracer dye, and more 
particularly an anionic dye. Specifically, because of the high knowledge and skill in the 
art regarding the structure, function, and use of polymerases, and in particular 
thermostable polymerases, one would be well aware of the feet that the Applicants were 
in possession of all of the embodiments of the invention as claimed at the time of filine 
the present application. B 

Most DNA polymerases possess various structural and functional characteristics in 
ITZT' £ cha *ac*ristics are highly conserved, even among completely unrelated 
polymerases. Theprcsence of these common characteristics creates a rather unvarying 
overall structure among DNA polymerases. Furthermore, they impart upon the 
polymerases a substantially uniform mechanism of action. 

r%?£in y ' th -A^ ^ S . tmCtUre ° f a DNA P 01 *™*** (the Klenow fragment of E. 
coli Pol I) provided priceless insights into the structure of polymerases, and particularly, 
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" ? A Polymcrase Struch ^ «»d Mechanisms of Action", found at 

Kcrib^ 3,1(1 attached ^ » Exhibit C 

«scmblSt ? T?' ffT 1 ^ of thc Kleoow foment is often said to 
ZhZ™* 5^ PW "R?? 4 , 11181 iS ' me stracture P° sscss a W subdomain (a 
Thxs structure is demonstrated in Figure 1 of the Hingorani and O'Donnell reference and 

fLt^c ^ Z EXhiWtD ' ™ S P^^ 8 ' sh *P c is ^ far the mos^ncm 

feature common to all known polymerase structures. Specifically, several otheT 

^StV s ^i dCte ^ ed SUtSCqUCnt t0 ** nmow ^nthavc demonstrated 
frl KntT ? ffcrcnccs » 8011110 «W sequences and structures among enzymes 
^ different iamihes several features essential for polymerase activiry, income 
particular S hapc outhned above, are highly conserved. Examples of thii ^ncludTKe 

ZT" ° f ^ P01 1 family ° f P 0 ^^ 00«m fragment KerU 

aquaticus pol, 77 DNA pol, and Bacillus stearothermophiL pol); members of foelol II 
family of polymerases (RB69 pol, D. tok Pol); members of *c farmlyTo^ 

w«hin the fam.ly of thermostable DNA polymerases, including for example Taa 
(Thermus aouaticus), Klentaq (Thermus aauaticus. N-terminaf dcLSuS Vent*1i 
/! em ?»T Dee P Vent Cr>w»«w species GS-D), Pfi, (SoSZ 
jurtosus). Repute (rA«™ M ^), KOD (Pyrococcus koda^aensT^^ 
(Thermus thermophilic HB-8), and Tfl (Thermus flavus). 

S^ETiS? m, T /P^ fi W «* synergistically to hold the primer 

27^ ™ J* WCl l M W P ° Siti0n * e 1°"**** dNTP for incorporation into me 
cheS g A ? NA , St ^ d n % Pa,m SUM0niail1 Contalns thc cata1 ^ wheS, me 
£3^™??"^ f^ fer ^ ^ fmgers ^domain Leracts with £ 

subdomam is pnmanly responsible for binding the duplex DNA in a sequence- 
independent manner along the minor groove of the DNA. See, Hingorani and O'Donnell. 

£S^ te "\" E " S M ° f W6bsitBS discussiQ 8 ^cture. 

!£u^j£Z?- *TV° n0h °? tCS , StatemeQtS COntaincd 01 de ^tion, as 
SIT m Hmg0rani 30(1 McDonnell reference and the additional 

references cited below mpangraph 18. Furthermore, the statements eon** Unte 

^ McDOm,e " rcfcrCnCC Cited above ^ ^Uy been JJSi*r 

W R mZT m f * A - S ?^ T A - (1 " 8) ^ *^&1SM3 and 
aousa, R. ( (1996) Trout? Bwchem Sri, 21:186-90. 

oo S lvltL^r , A the r Cti0n °if hanism *M«9ed by polymerases, including Taq 
polymerase and other thermostable polymerases, is universal among the polymerases 
Specifically, all polymerases follow the mechanism for T7 dZ^Z^Z S 
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aXte^orS.^ 005 " 1 ^ ^ ^ 2 ° f Hing0rani «* °' Donne » " d 



The structural basis of this catalysis (U, reaction mechanism) relies upon a two metal 
c^mecharusm Figure 3 of Hingorani and O'Donncll, attached hereTa^S G 

i STw^fZ 1 *' inC,Udin8 iUuStrations of theT7 DNA polymerase Sve ^e 
(A. of Exhibit G) and the two-metal ion mechanism for nucleotidyl I transfer (B. of Exhibit 

£2. n~ V Stat 1 l n " iDg0rani McDomdl (and as supported by Alba, Jagcr and 
Plata, Brautigam and Stat* and Sousa (citations listed in paraph 1 8 below)), toe 

5 Se TV T 1C °!! dyl ^ reaction wherein * e ^ Ai group 
f**?^,*^^ ThisoccuxsinL 
iS L I' SSI r m I hC Pa,m ^ ° f ^ P°^ crasc - a ^ve site, as 

tea LTvsSrcZ;J? CS A^ ""T' usua ^ consisting of Mg* + ions, are essential for 

, ^ tXOn ^ y ' tW0 a * artato ™ d ** are absolutely conserved 
between the polymerase families. These residues provide the carboxylate oxygen 

found near the: '-hydroxyl group of the DNA primer and the alpha-phosphate of he 
mcoming dNTP In this position, ion A is situated to lower toe pKa of toe hyfroxyl 
group and to facilitate formation of the alkoxide anion, which can the^by Ste 

5?j-Tl ^ f s P edfic %. « A acts to stabile the transition state 

(toe state wherem an alpha phosphorous is penta-coordinatcd to five oxygen molecule? 

bvyramidal structure). 5ee afro, Slickers, Peter, "DNA Polymerases," 
http://www.imb-jena.de/ ^lickeis/porym(«ses/dna-polymerases.html 

MSJSrT 1 * Hin80rani McDmwl] > ™^ B (as designated in B. of 
Exhibit G) ligates oxygen groups m all three of the phosphate groups of the dNTP It is 

EES** 5S ^ in ** dtoua - t ° f * e tri P ho ^ c ***** 2SS b^toe 3 ! 
" f— 10 StabiHzil,g *° Charge on ±e transiti0Q *> d aiding m the 

active site and possibly ion B as well, aid in the stabilization of toe charged 

SSSSES? 38 * dissociates *° m poIymerase completion of the 

wl.?^ mc f hanism described ab °ve, as well as the geometry of the active site are 
highly conserved among the polymerases. This conservation occL even amongtoc 
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SeSe UnrelatCdPOlymeraSeS SUCb M 17 DNA P 0 *"""* Md Pol P 
to addition to the references cited herein, the above statements regarding DNA 

h ' ST C . A ' "HSf* V- ,,StraCtUrB, ^ fUnctional provided by 

c. Jjngoranil, M.M. and O'Donnell, M. "DNA Polymerase Structure and 



e. 
f. 



Sousa R. •'Structural and mechanistic relationships between nucleic acid 
polymerases." Trends Bhchem Sci (1 996) 21,1 86-90. 

* kB f wk *° ^ d ? kiU in *• with respect to DNA polymerases, as can be 

^ Vari0US l"****"-. Furthermore, those of 
to m ^ ^ *" ****** a"* the skill sufficient to enable Sem 

T? 865 ', 88 suchk f owled S« «*» skill are taught to students in ubS^ST 
molecular biology glasses and are practiced by undergraduate students in their EbtSuL 

^t£S?isr t0 ose of ^ modcrate skin * ** a ^ 

were in possession of not only a composition comprising Taq polymerase as the 
p P Se™^^ 

rtr^K ^ ^ ? ^J" 1 ^ embodiments of the present invention arc 

nUme T can be TOcd simi1 *' «««». Particularly 

oSld L *f 'cT'T ^ W f d * 801431,16 foruse in P~* invention^ 
outlined in the specification, and particularly, in Example 1 . 

wLffT 1 t^ plC l : ani ° nic dycs wcre sdected for SCfee "»g because they would 

ES^S^Jt nUCl f C 8dds **** the Polymerization reaction^ Tnis 
88 * rKult ° f 48 char 8~hargc repulsion. SpecificaUy. since the nucleic acid products 
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of PCR are highly anionic, they are therefore applied to the agarose gel near the cathode 
and move toward the anode as the electrophoresis progresses. Therefore, to be of use as a 
tracer in electrophoresis, the dye is also preferably anionic as well. In contrast, organic 
cations will almost certainly interact with polyanions such as polynucleic acids. Neutral 
dyes are also of little use due to their limited aqueous solubility. 

Furthermore, about one-half of the anionic dyes were thereafter eliminated from 
consideration by the mere fact that they were not of a red color. Specifically, Applicants 
demonstrated in Example 1 only dyes that were red. This color was selected for no 
reason other than aesthetics. The assignee of the present application, Sigma-Aldrich Co , 
typically selects products displaying red or a particular shade of red, as the color red is 
often associated with the Sigma-Aldrich Co. and many of the marks it has registered. 

The use of a red dye conferred no particular advantage on the embodiment of Example 1 
and there is no reason why dyes of other colors would not be described by and 
encompassed within the present invention. 

The dyes were further eliminated from consideration on the basis of characteristic such as 
ethanol precipitation, solid phase extraction, PCR toxicity, and ligation/transformation 
toxicity. These characteristics were chosen based upon the particularities of Example 1 
However, based upon the structural and functional characteristics of Taq polymerase (the 
polymerase used in Example 1) as well as knowledge and skill regarding anionic dyes it 
is clear that numerous other anionic dyes would also work well. Included within this 
group arc the dyes initially eliminated solely because they were not red in color. More 
specifically, other dyes would work well with DNA polymerases. Artisans practicing the 
invention would be able to determine, based upon the description and the general 
knowledge within the art, which dyes would be ideal for the particular embodiment being 
practiced without undue experimentation. 

In terms of the generalized polymerase mechanism shown above (Exhibit F) dyes of the 
present invention would not inhibit polymerization by reducing the concentration of DNA 
through the creation of a DNA-dyc complex. Additionally, as demonstrated by the 
present application, dyes of the present invention do not inhibit polymerization by other 
mechanisms, such as mechanisms in which the dye binds to the enzyme. Specifically as 
demonstrated in the present application, screening a large number of dyes showed that a 
subset did not adversely bind to Taq DNA polymerase in such a manner as to inhibit 
polymerization. Therefore, instructive of the present application is the fact that dyes of 
the present invention do not interact with the polymerase by binding to its structural 
leatures. 



This mode of inhibition was anticipated and/or found to be active or passive in nature In 
terms of active inhibition, the dye could competitively bind at one of the catalytic 
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27. 



28. 



subdomains thereby inhibiting the binding of key reaction components. It is conceivable 
that an anionic dye could competitively bind the enzyme at one or more of the template 
the primer, or the nucleotide binding surfaces. As demonstrated in the present 
application, specification p. 28, line 13 through p. 29, line 35, it was found that dyes 
passivery inhibit the enzyme by coordination of the required metal ion. This was, 
therefore, compensated for by making the dye metal the metal required by the 
polymerase, followed by optimization of free metal concentration. 

It was anticipated that the same process would be followed for other polymerase/dye 
combinations Specifically, it was believed that the successful use of anionic dyes as 
described in the present application would not only enable another to use the specific dyes 
disclosed, but also would enable one to use other anionic dyes that possess similar 
properties, as the dyes of the specific embodiments are indicative of the characteristics of 
other dyes that could be used with equally satisfactory results. Likewise, it is asserted 
that the structural and functional characteristics of the dyes of Example 1, as described 
therein, would clearly convey to a person of skill in the art that Applicants were in 
possession of the broader invention as claimed - i. e ., wherein the dye tracer component of 
a composition of the invention comprised an anionic dye that does not interfere with 
polymerase activity. The description of the dyes used in the particular embodiments of 
the present invention would indicate to one of skill in the art that Applicants were in 
possession of the invention as claimed at the time of filing. 

I further declare that all statements made herein of my own knowledge are true and that 
all statements made on information and belief arc believed to be true; and further that 
these statements were made with the knowledge that willful false statements and the like 
so made are punishable by fine or imprisonment, or both, under 1 8 U.S.C. 1001, and that 
such w, lful false statements may jeopardize the validity of the application or any patent 
issuing thereon. / s~\ r 

Date^ 1 ' 
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EXHIBIT B 



BRIAN WARD, Ph.D. 

1640 Jonquil Dr. 
Webster Groves, MO 63119 
bwardhome@rocketmail.com 
(314) 962-0295 (H) 
(314) 771-5765 x3657 (W) 

EMPLOYMENT HISTORY 

9/88-present. Sigma_Aldrich Inc., St. Louis, MO. Currently technical lead and manager for Gene 
Expression and Analysis, Sequencing and Labeling, Hybridization and Detection. Developed REDTaq 
DNA polymerase for which a provisional patent was filed (see Patents section). Was in part responsible for 
the conception and development of the Exoclone technology. This technology obviates the need for 
restriction digestion of PCR products to yield sticky ended sequences that would be used in a subsequent 
ligation reaction. Initiated the development of other novel ligation technologies. 

Initiated a product line aimed at serving the nucleic acid sequence recognition research community. Most 
notable accomplishment is the invention of Type IIS Restriction Enzyme Footprinting (see research synopsis 
A for additional details). Established collaborative/cooperative relationships with internationally renowned 
scientists resulting in commercialization of the chemical footprinting reagent methiffiumpropylEDTA and the 
triplex specific intercalator BePI. Some of these relationships are also yielding publications. 

Was one of two scientists involved in the cloning and expression of commercially significant proteins in E. 
Coli (see research synopsis B). 

Developed and maintained oligonucleotide, DNA sequencing and electrophoresis products. New products 
included: stop linkers (oligonucleotide cassettes used to introduce stop codons into all reading frames of an 
expression vector), Maxam and Gilbert sequencing kit and electrophoresis convenience products. 

1/87-9/88. Biochemical Development, Sigma Chemical Company, St. Louis, MO. 

Developed a novel synthesis for meso-isothiocyanato octaethylporphyrin palladium (II), a molecule that 
could potentially be exploited as a phosphorescent label. Scaled up the synthesis of 3-azido- 
3'deoxythymidine (AZT). Produced/synthesized an assortment of department specific products. 

9/84-12/86. Department of Chemistry, Syracuse University, Syracuse, NY. 

Developed footprinting protocols and techniques to measure drug binding constants to DNA's from natural 
sources (see research synopsis C). Collaborated with Bristol-Myers scientists on a number of projects. 
Wrote a Bristol-Myers funded research grant to synthesize a potential sequence specific DNA cleaving 
molecule. 

9/78-8/84. Department of Chemistry, Michigan State University, East Lansing, ML 

Designed, synthesized and performed physical chemical experiments on heme model systems (see research 
synopsis D). Taught undergraduate laboratory and recitation courses. Merit Level Teaching Award was 
received for outstanding performance. 

4/83-10/83 Department of Engineering, Michigan State University, East Lansing, MI. 

Acted as a consultant to determine the practicality of using photochroic dyes to study fluid flow dynamics. 

The known, reversibly photochroic dyes owed their photochromogenic property to a 

photo induced zwitterionic charge transfer complex. Since the colored zwitterion form is a dominant species 

in polar solvents, non-polar (expensive and likely flammable) solvents would have been required for the 




Brian Ward, contd. 

studies. Due to the large volumes of solvent envisioned, it was decided that the proposed experiments 
not practical. 



ADMINISTRATIVE RESPONSIBILITIES 

Manage Gene Expression and Analysis, Sequencing and Labeling, Hybridization and Detection Groups. 
Instigated and organized departmental/company seminars. 
Interviewed applicants for R&D positions. 
Administered grant funds while principle investigator was on sabbatical. 

EDUCATION 

Post-doctoral, Biophysical Chemistry, Syracuse University, Syracuse, NY (1986) 
PhJX, Organic Chemistry, Michigan State University, East Lansing, MI (1984) 

B.S., Chemistry (major), Biology (minor), Mathematics (minor), Western Michigan University, Kalamazoo 
Michigan (1977) 

HONORS AND AWARDS 

Senior Chemist Dow Fellowship Amway Fellowship Merit Level Teaching Award 

PROFESSIONAL MEMBERSHIPS 

American Association for the Advancement of Science 
American Chemical Society 
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Summary 

Replicative DNA polymerases are essential for the replication of the genomes of all living organisms. 
On the basis of sequence similarities they can be classified into three types. Type A polymerases are 
homologous to bacterial polymerases I, Type B comprises archaebacterial DNA polymerases and 
eukaryotic DNA polymerase a, and the bacterial polymerase III class make up type C. Structures have 
been solved for several type A and B polymerases, which share a similar architecture. The structure of 
type C is not yet known. The catalytic mechanism of all three types involves two metal-ion-binding 
acidic residues in the active site. Replicative polymerases are constitutively expressed, but their activity 
is regulated through the cell cycle and in response to different growth conditions. 



Gene organization and evolutionary history 
Classification 

On the basis of sequence similarities, DNA polymerases can 
fall into three groups: type A, type B and type C, which have 
homology to polA (pol I), polB (pol II) and polC (pol III) from 
Escherichia coli, respectively [1,2]. Type C polymerases are 
not known to share structural similarity with types A and B, 
so they will not be covered in much detail in this review. In 
addition to replicative DNA polymerases, these groups also 
include polymerases involved in other types of DNA synthesis 
and in DNA repair. For example in bacteria the main replica- 
tive DNA polymerase is pol III (type C), while DNA poly- 
merase I (type A) is not essential for replication and DNA 
polymerase II (type B) is only present in a few bacteria. Pol I 
has a role in nucleotide excision repair and in the processing 
of Okazaki fragments that are generated on the lagging strand 
during DNA replication, while pol II is known to be involved 
in repair of DNA cross-links. Polymerase delta from eukary- 
otes belongs to the same polymerase group as the replicative 
DNA polymerase a (type B). The role of polymerase delta in 
replication however is not clear although there is evidence 
that it participates in post-replicational DNA repair. 

Replicative DNA polymerases from some bacteriophages (T3, 
T5 and T7) and eukaryotic mitochondrial DNA polymerases 
have homology to bacterial polymerases I and are therefore 



type A polymerases. Eukaryote replicative polymerase a, 
archaebacterial DNA polymerases, viral DNA polymerases, 
DNA polymerases encoded in mitochondrial plasmids of 
various fungi and plants and some bacteriophage poly- 
merases (T4 and RB69) all belong to type B. The bacterial 
DNA polymerase III class, members of which are responsible 
for most of the replicative DNA synthesis in bacteria, are type 
C DNA polymerases [1]. The three types share no obvious 
sequence similarity, but types A and B are structurally similar 
to each other (see below). In general, a single gene for each 
type is found in the different organisms, but there are excep- 
tions; for example, some bacteria have several genes for type 
C DNA polymerases. Little is known about the structure of 
the DNA polymerase a gene in eukaryotes, although the 
mouse gene is known to contain four exons. 

Characteristic structural features 
Sequence features 

Type A polymerases contain three conserved motifs: A, B 
(Prosite signature PS00447 [3]) and C. Motifs A and C 
(Figure 1) are part of the catalytic site, whereas motif B is 
involved in the binding of dNTPs. In type B polymerases, up 
to six regions of sequence homology have been identified. 
Regions I (Prosite signature PS00116) and II form part of the 
active site and are considered to be equivalent to polymerase 
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type A motifs C and A, respectively (Figure i). In these 
regions, metal-ion-binding aspartic acid residues are in 
equivalent structural positions [4,5]. 

In addition to the catalytic domain, DNA polymerases often 
have additional domains required for editing activity, exci- 
sion of Okazaki primers during replication (structure- 
specific 5' nuclease activity), or for interactions with other 
proteins. Bacterial and archaebacterial replicative DNA poly- 
merases can contain a 3'-to-5* proofreading exonuclease 
domain, and the eukaryotic a polymerase catalytic subunit 
contains a zinc finger domain required for interactions with 
other summits of the polymerase-primase complex [6]. A 
35-residue fragment with the potential to form a Zn-finger 
has also been identified in type C DNA polymerase from bac- 
teria. This region appears essential for the proper formation 
and/or function of the enzyme's polymerase site [7]. 

Structural features 

The E. coli DNA polymerase I structure was detemined in 
1985 [8], and since then, several other type A DNA polymerase 



structures, such as T7 DNA polymerase [9] (Figure 2a), have 
been solved. For type B DNA polymerases three crystal struc- 
tures are available, from phage RB69 [10] and from two Ther- 
mococcus species of archaebacteria (Figure 2b) [11,12]. 

The catalytic domains of type A and B DNA polymerases 
have a common overall architecture, which resembles a 
right hand and consists of 'thumb', 'palm* and 'fingers' 
domains [4] (Figure 2a,b). A similar structure and catalytic 
mechanism is shared by other families of polymerases, 
such as eukaryotic DNA polymerase p, reverse transcrip- 
tases and RNA-dependent RNA polymerases. The most 
conserved region is the palm domain, which contains the 
catalytic site. The fingers and thumb have somewhat differ- 
ent arrangements in the two families, although the thumb 
always contains parallel or anti-parallel a helices that 
appear to interact with the minor groove of the primer- 
template complex, and the fingers have an a helix with 
conserved sidechains positioned at the blunt end of the 
primer-template complex. 



Type A, motif A 



Type A, motif C 



Type B, motif II 




ype B, motif I 




Figure I 

Logo representation of DNA polymerase domains from 
representative polymerase sequences. The sequences were 
extracted from SwissProt links in Prosite [3] to the DNA 
polymerase type A (PS00447) and DNA polymerase type B 
(PSOO 1 1 6) families. The families were initially found using 
*DNA polymerase* as keyword. Sequence logos show the 
relative representation of the different amino acids at each 
sequence position in terms of the size of the appropriate 
letter in the single-letter amino acid code; the largest letters 
are the most conserved. They were constructed using the 
online server WebLogo [17]. Type A DNA polymerases (10 
sequences) were from £ coli (polA), Bacillus subtilis, Thermus 
aauaticus, Aquifex aeolicus, Mycobacterium tuberculosis, phage 
T3, phage T5, phage T7, mitochondrial Saccharomyces 
cerevisiae DNA polymerase and mitochondrial Homo sapiens 
DNA polymerases. Type B DNA polymerases (15 
sequences) were from H. sapiens (a and 5), S. cerevisiae (a 
and 5), Drosophila melanogaster (a), phage T4, phage RB69, 
Chtorella virus NY-2A, Zeo moys plasm id S- 1 , herpes simplex 
virus I , vaccinia virus, African swine fever virus, 
Archaeoglobus fulgidus, Pyrococcus horikoshih £ coli (polB). 



Localization and function 

The replicative polymerases are required for the faithful 
replication of the genetic material and they perform this role 
by attaching the appropiate nucleotide to the nascent strand 
to match the template strand. The replication machinery, 
which includes a number of other well-conserved enzymes 
such as helicases and primases, assembles at the replication 
origin, where the replicative DNA polymerase initiates DNA 
synthesis using short DNA or RNA primers. All known DNA 
polymerases synthesize DNA in a 5'-to-3' direction. 

The replication of DNA occurs before cell division. Replica- 
tive DNA polymerase genes are housekeeping genes and in 
eukaryotes the enzyme is found in all nuclei. The polymerase 
activity can be controlled in a cell-cycle-dependent manner 
or in response to different environmental conditions, 
however. For example, in mammals the activity of replicative 
DNA polymerase a is regulated by phosphorylation during 
the cell cycle [13] and its expression is stimulated by specific 
transcription factors during growth [14]. 



Enzyme mechanism 

DNA synthesis is mediated by transfer of a phosphoryl group 
from the incoming dNTP to the DNA 3' OH, liberating a 
pyrophosphate and forming a new DNA phosphodiester 
bond. This reaction is catalysed by a mechanism that 
involves two metal ions, normally Mg 2+ , with the participa- 
tion of two aspartic acid residues that are structurally con- 
served among the different enzymes [5,6]. These are the 
aspartic acids conserved in motifs A and II, and C and I, 
from type A and type B polymerases, respectively (Figure 1). 
The first metal ion activates the 3-OH for attack of the a 
phosphate of the incoming dNTP and the second metal ion 
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Figure 2 

DNA polymerase protein structures. The figures were generated with the free software Rasmol (RasWin Molecular 
Graphics, Windows Version 2.4) and files extracted from the Protein Data Bank (PDB) [18]. (a) Bacteriophage 77 DNA 
polymerase (type A; PDB code: It7p) [9]. Blue, catalytic domain; green, 3'-to-5' exonuclease domain, (b) Thermococcus 
gorgorianus type B DNA polymerase [II] (PDB code: Itgo). Blue, catalytic domain; green, 3'-to-5' exonuclease domain; 
orange, amino-terminal domain; yellow, linker region. 



stabilizes the negative charge that builds up on the leaving 
oxygen and chelates the p- and y-phosphates. 

The catalytic mechanism of the type C DNA polymerase cat- 
alytic subunit may be similar to type A and B polymerases. 
Three conserved aspartic acid residues have been identified 
that strongly affect the polymerase activity and that may 
therefore be involved in the coordination of metal ions in the 
active site [15]. 

Frontiers 

Replication in prokaryotes and eukaryotes is a relatively well 
studied pathway and a number of solved type A and type B 
DNA polymerase structures have provided evidence of a 
common architecture and active-site mechanism. Similar 
features have been observed in other non-renlicative DNA 
polymerases. The determination of structures of type C DNA 
polymerases will be interesting, as it may show whether all 
polymerases can be unified as a single structural class. 

An interesting aspect of replicative DNA polymerases is the 
diversity of enzymes, covering at least three different 
sequence families, that perform the same function. DNA 
replication is an ancestral function and as such is expected to 
be well conserved. The presence of different types of func- 
tionally homologous polymerases in different organisms sug- 
gests a complex evolution, however. One hypothesis, which 
involves transfer of DNA polymerase genes from viruses to 



eukaryotic genomes, has recently been put forward to 
attempt to address this question [16]. There is clearly much 
still to be learnt about these important enzymes. 
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New levels in the understanding of DNA replication have 
been achieved from recent crystal structure determinations of 
several DNA polymerases and their substrate complexes. The 
structure of an a family DNA polymerase from bacteriophage 
RB69 shows some similarities, but also considerable 
differences in structure and organization from the pol I family 
DNA polymerases. Also, the functions of three polymerase 
domains and their conserved residues have been clarified 
by studying structures of pol I family DNA polymerases 
complexed to their substrates. These structures also confirm 
that an identical two-metal ion catalytic mechanism proposed 
previously is used by both the nonhomologous po! I and pol |3 
family DNA polymerases. 
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pol a DNA polymerase a 

pol P DNA polymerase p 

RT reverse transcriptase 

T7 pol 17 DNA polymerase 

Tag pol Tag DNA polymerase 



Introduction 

Significant advances in the structural biology of DNA 
polymerases have been made on two fronts. First, the 
library of known polymerase structures has expanded to 
include the gene 43 protein (gp43) from bacteriophage 
RB69 [1"L a member of the human DNA polymerase a 
family, offering insights into eukaryotic replicative poly- 
merases. This expanded library also includes structures of 
polymerases from bacteriophage T7 [2 ## ] and a Bacillus 
stcarothermophilus strain [3*1 — examples of polymerases 
homologous to DNA polymerase I from Escherichia coli. 
On the second front, a combination of structural and 
biochemical techniques has expanded our understanding 
of several properties of DNA polymerases, such as 
substrate binding [4" ) 5 # ,6 M ], processivity [2",6**], fidelity 
[2",7*], and nucleotidyl transfer [2**]. Although DNA 
polymerases often occur as multifunctional polypeptides, 
we shall restrict the scope of this review to the polymerase 
domains. 



The structure of gp43 from bacteriophage 
RB69 

The gp43 protein from bacteriophage RB69, which is 
a close homologue of bacteriophage T4, is primarily 
responsible for the replication of the viral genome [8]. 
It shares both sequence and functional homology with 
human DNA polymerase a (pol a), an enzyme implicated 
the replication of chromosomal DNA. The gp43 protein 
contains the six conserved sequence hallmarks, known 
as regions I-VI, that are common to all pol cx-like or 
*B family* polymerases (9). Functionally, it shares several 
attributes with these polymerases: it is responsible for both 
leading- and lagging-strand synthesis, it is tethered to the 
substrate DNA by a . protein processivity factor and it is 
stimulated by a single-stranded DNA binding protein [8]. 
Thus, knowledge gained from the study of gp43 will be 
applicable to eukaryotic replicative DNA polymerases. 

The crystal structure of RB69 gp43 at 2.8 A resolution [!»•] 
reveals some surprising features along with some known 
structural motifs. The overall structure of the 103*kDa 
polypeptide is that of a disc with a small hole in the 
center (Figure 1). Arrayed around this centra! cavity are 
five structurally and functionally distinct domains that 
form three clefts, termed cleft D, cleft T, and the editing 
channel (Figure la). Approximately one-half of the disc is 
formed by the polymerase portion of gp43, which adopts 
the right hand shape common to all known polymerase 
structures. It comprises three domains: the fingers, palm 
and thumb (Figure lb). The catalytic center of the 
polymerase is located on the palm domain, near the central 
cavity. The disc is completed by an N-terminal domain 
ana* the 3'-5' exonuclcase domain, which is homologous to 
the Klenow fragment (KF) editing domain. The structure 
of this half of the disc is nearly identical to that of 
the homologous KF domain from bacteriophage T4 gp43 
determined by Wang et al. [10]. 

One unexpected feature of the RB69 gp43 structure is 
the position of the proof-reading domain with respect 
to the polymerase domain, which is opposite to their 
arrangement in the DNA polymerase I (pol I) family poly- 
merases. Also, unlike the pol I family, a single-stranded 
tetranucleotide binds in the editing channel so that no 
translation of the primer-template duplex is required for 
the primer terminus to bind either in this channel or at 
the polymerase active site. In Tag DNA polymerase (Tag 
pol), the primer-template duplex bound to the polymerase 
active site is translocated relative to the frayed duplex 
DNA bound with its primer terminus in the KF-like 
proof-reading site [4",11]. As is generally the case for the 
pol a family, the editing activity of this phage polymerase 



This Document has been provided by Technical Information Service of Purdue University and is copyright protected. 



DNA polymerases Brautigam and Steitz 55 



Figure 1 




The structure of gp43, the a family DNA polymerase from phage RB69. (a) A surface representation of the protein with homology modeled 
DNA. The polymerase half of the enzyme is on the right, whereas the exonuclease portion is on the left. The three clefts are labeled. The DNA :n 
the structure has been modeled [1"] by comparison with the Taq pot protein-DNA co-crystal structure. The duplex product Ms into cleft D wnile 
the modeled 5' template was placed in cleft T. The central cavity is obscured by the DNA model, (b) A ribbon representation of the polymerase 
domain of gp43. The structure is color-coded by domain, with the fingers in the darkest gray, the thumb in the lightest gray, and the paim in 
an intermediate shade. The palm is dominated by an antiparallel ji sheet. The thumb is mainly a-helical. but also contains some 3 strands. The 
unusual fingers feature two very long a helices, (c) Schematic representation showing the similar overall topologies of the palm domains of gp43 
and Taq pol. Both comprise an antiparallel p sheet flanked on the noncatalytic side by two a helices. The positions of the conserved catalytic 
aspartates are shown as open circles, (d) A model of DNA and accessory proteins bound to gp43 [1"]. The DNA is modeled as in (a). The 
clamp protein, gp45, is proposed to interact with both the DNA and gp43 at the exit point of deft D. The singie-stranded DNA-bmdmg prote n. 
gp32, is proposed to interact with the single-stranded template after it 'eaves cleft T. Reproduced with permission from [1 ••]. 



is about 1000-fold higher than che exonucleolytie activity 
of the £. co/i pol I enzyme [12-161. The structure of a 
single-stranded DNA substrate bound to the exonuclcase 
active site of gp43 shows :hc e-amino group of a lysine 



residue in close proximity to che DNA scissiie phosphate 
[I •MO). This lysine is conserved among the pol a family, 
but is not present in the pol I family. The £-amino group ot 
this residue may enhance the exonuciease activity of g?43 
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by stabilizing the pentacovalent transition state through 
an interaction with a negatively charged, non-bridging 
phosphate oxygen. 

A comparison of the polymerase motif of gp43 with the 
analogous motif in a binary complex between Tag pol 
[4-] and double-stranded DNA shows that the topology 
of the palm domains is identical and thus, by homology 
modeling, suggests the direction of DNA replication when 
bound to gp43. The structures of the P-sheet portions 
of the gp43 and Tag pol palm domains arc very similar 
(Figure 1c). By superimposing the homologous palm 
domains of the binary complex of Tag pol and gp43 
and by assuming that the two polymerases bind DNA 
similarly, a model for primer-template bound to gp43 was 
built [l"]. It positions the double-stranded portion of the 
DNA substrate at cleft D (Figure Id). Furthermore, cleft 
T is in a suitable position to bind the single-stranded 
DNA template. This model of DNA bound to these clefts 
allowed Wang et aL [!"] to surmise the positions of the 
accessory proteins of gp43 within a functioning repiisome 
(Figure Id). The processivicy factor of RB69, gp45, like 
that of the T4 phage, is a sliding-clamp protein whose 
donut-likc structure is homologous to that of the p subunit 
of DNA polymerase III from E. colt Q Kuriyan, personal 
communication; see also [17]). It is proposed that gp45 
interacts with the long C-terminal tail of gp43 as well 
as the double-stranded DNA on the polymerase side of 
the disc, at the exit point of cleft D [1"L This model is 
supported by the fact that removal of the C-terminal tail 
of gp43 eliminates gp45 -gene rated processivity [18]. The 
single-stranded DNA-binding protein from RB69, gp32, 
probably associates with gp43 on the exonuclease side of 
the disc, since cleft T exits the polymerase there. Thus, 
taken in the context of other polymerase structures, the 
structure of gp43 allows a reasonable model of a portion 
of the cukaryotic replication apparatus to be inferred. 

The structure of gp43 also clarifies some aspects of the 
interrclatedncss of pol a family polymerases and the 
relationships between this family and other polymerase 
families [!••]. Using the gp43 structure, the sequence 
alignments of all pol a family polymerases can be 
extended beyond regions I-VI to a span of nearly 200 
residues encompassing all six of the previously identified 
sequence hallmarks (Figure 2a). An earlier attempt [19] 
to align the sequences of all DNA and RNA polymerases 
resulted in the conclusion that the active-site motif DTD 
(residues 621-623) of pol a was equivalent to a DD motif 
(residues 184-185) in HIV-1 reverse transcriptase (RT) or a 
DE motif (residues 882-883) in KF. The threonine of the 
pol a sequence was aligned as an insertion between the 
two conserved acidic residues. The structures of gp43, KF 
and HIV-1 RT [20], however, show that these two acidic 
residues are in similar positions only in KF and HIV-1 RT. 
Across all polymerases, only the second D of the DTD 
motif and the first D of the DD(E) motif are structurally 
conserved [!••] (Figure 2b). 



Other polymerase structures 

The crystal structures of two homologues of DNA 
polymerase I from £. colt have been solved recently. The 
first, a Klenow-like fragment of the DNA polymerase I of a 
thermophilic Bacillus species [3-], has been solved at 2.1 A 
resolution. The structure of this enzyme is very similar 
to Klenow fragment [21], especially in its polymerase 
domain. The second structure, a quaternary complex of T7 
DNA polymerase (T7 pol), thioredoxin (a processivity fac- 
tor), DNA, and incoming dideoxynucleoside triphosphate 
was solved to 2.2A [2-]. The complex shows that the 
three-dimensional structure of T7 DNA polymerase is also 
analogous to that of KF. 

Structural comparisons among DNA 
polymerases 

A comparison of all DNA polymerase structures elucidated 
to date reveals some interesting commonalities and some 
striking differences. Presently, the structures of enzymes 
from four DNA polymerase families [9] have been 
established: RB69 gp43 from the pol a or 4 B family' 
U-]; KF [21], Tag pol [22], T7 pol [2«], and both the 
Bacillus and Thcrmus aguaticus KF analogues [3*,23] from 
the pol I family; rat and human DNA polymerase p (pol 
P) [S',24,25] from the terminal transferase family (26]; and 
HIV-1 RT [20,27-29] and a fragment of Moloney murine 
leukemia virus RT [30] from the reverse transcriptase 
family. The palm domains of all DNA polymerases consist 
of a four- to six-stranded p sheet that is flanked on one 
side by two a helices. In the pol 1, pol a and reverse 
transcriptase families, the topologies of the palm domain 
arc the same. Even though the palm domain of pol p 
is mostly p sheet, its topology differs from that of the 
other polymerase families and thus was not derived from a 
common evolutionary ancestor. Beyond the palm domains, 
structural comparisons break down. The structures of the 
fingers domains vary widely from family to family, and, 
although the thumb domains are mainly ct-helical, the 
detailed structures of these domains arc also not related. 
Perhaps surprisingly, the fingers and thumb domains in all 
four families have arisen from different ancestors. 

Although there are structural differences among the fam- 
ilies of DNA polymerases, complexes of these enzymes 
with primer-template DNA show that the anatomical 
features of these enzymes play analogous roles across fam- 
ily boundaries, often using similarly oriented secondary 
structural elements. This can be illustrated by viewing 
a binary complex of blunt-ended double-stranded DNA 
and Tag pol, an enzyme whose polymerase motif is a 
close homologuc of the polymerase motif of KF (Figure 
3a) [4",22]. The primer terminus of the DNA abuts the 
fingers domain of Tag pol and several acidic residues from 
the palm domain that are responsible for binding the 
catalytically essential metal ions arc located close to the 
primer. The DNA leads away from the fingers and has 
several contacts with the palm domain. It then encounters 
the thumb domain, which has extensive contacts with 



This Document has been provided by Technical Information Service of Purdue University and is copyright protected. 



DNA polymerases Brautigam and Steitz 57 



Figure 2 
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Sequence comparisons among DNA polymerases, (a) A comparison within the pol a family. Included in this comparison are polymerases 
from RB69 and T4 (rb69 and t4>, herpes simplex virus 1 (hsv1) t £ coA" (pol !l, [epolb]}, Pyrococcus furiosis (vent), humans (pol a. [hpoia]), 
and yeast (pol l f [ypoll J). Arrows and cylinders above the alignment denote secondary structure, and regions I— VI are shown below. Capital 
letters are consensus residues. A consensus sequence is shown above the alignment. Residues shown in the darkest shade are conserved 
carboxylates, medium gray residues are solvent-exposed residues that may be involved in substrate binding, and lightest gray are positively 
charged residues. Boxed residues are carboxylates whose roles are unclear from the gp43 structure, (b) Sequence conservation among all 
DNA polymerases. In addition to some of the polymerases enumerated in (a), those from phage T5 (t5), Merhanococcos jannaschii (mjan), 
mitochondrial Chlamydomonas rheinhardtii DNA intron (rtchla), sendai virus L (sendai), HIV-1 (HIV-1 RT), polio virus (polio), £ coli (KF), 
Thermus aquaticus (taq) t and T7 RNA polymerase (T7 RNAP) are shown. Region II of the pol a family polymerases corresponds to motif A of 
[19], while region I corresponds to motif B. This sequence alignment is based on the conserved positions of carboxylates in the palm domain. 
Only two carboxylates are completely conserved across all of these polymerases (shown in boxes). The first D of the DTD motif and the second 
carboxylate of the DD(E) motif (arrows) are not structurally equivalent. Reproduced with permission from [V]. 
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the DNA across its minor groove. Whereas che above 
statements specifically concern Tag pol, they also hold 
true for T7 pol ([2"]; Figure 4a), HIV-1 RT ([27]; Figure 
3b) and also for the homology model-built DNA complex 
with RB69 gp43 ([1**]; Figure 3c). The ternary complex 
of pol p with DNA and incoming dideoxynuclcoside 
triphosphate (ddNTP) is shown in Figure 3d. Here, the 
primer-template DNA abuts domain D, and domain B 
contacts the DNA across the minor groove. Based on this 
functional analogy, it is clear that domains D and B would 
most usefully be called fingers and thumb, respectively. 
Wc shall follow this nomenclature throughout this review, 
even though a previous inappropriate alignment of pol p 
with non-homologous polymerases has led to an opposite, 
functionally less useful naming of these domains [25]. 
Mewed with this assignment in mind, all of the statements 
made above regarding Tag pol arc true for pol p and an 
additional similarity among the pol a, pol I, and terminal 
transferase families becomes evident. They all have an 
a helix located in the fingers domain near the primer 
terminus that houses residues responsible for nucleotide 
binding. No analogous a helix is present in the reverse 
transcriptases, but rather there is an anti-parallel P hairpin 
located at the primer terminus. 

The enzyme-substrate complexes of Tag pol [4* # ] and 
T7 pol [2"], along with the changed assignment of 
pol p, allow us to resolve a controversy that exists in 
the field [31-33]. A structure of an editing complex 
between KF and DNA shows the double-stranded portion 
of the substrate bound between the thumb and the 
3-5' exonuclease domains [11]. This led Steitz and 
colleagues to propose that when the DNA was bound 
at the polymerase active site, it occupies roughly the 
same position as in Tag pol (Figure 3a). However, the 
alignment of KF and pol P resulting from the previous 
inappropriate superposition of their non-homologous palm 
regions spawned a proposal that the QNA should be bound 
to KF in the opposite orientation [25]. The structures of 
Tag pol and T7 pol demonstrate unequivocally that the 
proposal of Steitz and coworkers is correct and that the 
latter proposal should be disregarded. 

Structural and functional insights into the 
activities of DNA polymerases 

The quaternary complex of T7 pol [2**], the structure of 
Tag pol bound to DNA [4**] and several new structures 
of pol P [5*,7*] are remarkable for the insight they give 
into the processivity, fidelity, catalytic mechanism and 
conformational flexibility of DNA polymerases. 

Processivity 

DNA polymerases have the unique feature that they 
may remain associated with the substrate primer-template 
over many catalytic cycles, that is, they are proccssive 
[8]. Replicative DNA polymerases need the aid of a 
protein factor in order to remain processive over the many 
thousands of nucleotide additions required for genome 



replication. T7 pol, a viral replicative DNA polymerase, 
commandeers the £. cols protein thiorcdoxin for this 
purpose [34]. The crystal structure of the quaternary 
complex of T7 pol and its substrates includes thioredoxin 
([2**]; Figure 4a). The factor binds to a loop that protrudes 
from the tip of the polymerase thumb. Although electron 
density for DNA does not appear near thioredoxin due to 
disorder, extrapolation of the visible DNA helix positions 
the primer-template for a potential interaction with the 
factor (Figure 4a). Since all kinetic schemes for DNA 
polymerases include a step in the catalytic cycle during 
which the DNA is susceptible to dissociation from the 
enzyme [13,16,35,36], thioredoxin or other processivity 
factors act at this stage, either sterically or electrostatically 
hindering the dissociation of the DNA. 

The thumb domain has been implicated in processivity 
in two other polymerases. When viewed along the bound 
duplex DNA, KF is seen to surround the DNA on three 
sides with the tip of the thumb on top [11], Deleting the 
tip of the KF thumb reduces its processivity about fourfold 
[6**]. Furthermore, a thioredoxin-sensitive increase in 
processivity can be conferred on KF by grafting the 
thioredoxin-binding loop from T7 pol onto the tip of the 
KF thumb [37**]. In pol P, deletion of an 8kDa extension 
from the tip of its thumb likewise reduces its processivity 
[38]. Also, a metal ion mediates interactions between the 
pol P thumb and the substrate DNA, possibly discouraging 
the DNA from dissociating during the enzyme's catalytic 
cycle [5*]. Thus, in these three examples (and perhaps 
in all polymerases), the thumb plays an important role in 
maintaining the polymerase's grip on its DNA substrates. 

Fidelity 

An enzyme's fidelity, or its ability to incorporate the correct 
substrate from among a sea of potential imitators, is a 
general problem in enzymology. The problem is magnified 
with DNA polymerases. These enzymes must choose from 
a pool of four deoxynucleoside triphosphates (dNTPs) 
and this choice is directed by yet another substrate (the 
single-stranded template). Furthermore, the consequences 
of an incorrect choice may be catastrophic to the organism. 
Thus, fidelity is of central importance in the reaction 
catalyzed by DNA polymerases. It is therefore surprising 
that in the T7 DNA polymerase quaternary structure [2**] 
there arc no hydrogen bonds between the protein and the 
bases of the incipient base pair formed by the template 
and incoming nucleotide. Instead, it seems that the basis 
for error discrimination at this juncture is the steric 
complementarity between the protein and a correctly 
formed Watson-Crick base pair. This is in contrast to the 
pol p ternary structure [25], in which polymerase residues 
hydrogen bond to the nascent base pair. Amino acids 
from the palm of the T7 pol do, however, interact with 
base pairs near the 3' terminus of the primer, suggesting 
that these residues are scanning for newly-made errors in 
polymerization. Since mutation of these residues results in 
a lower affinity of the enzyme for DNA [39,40], they could 
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Figure 3 




A comparison of primer-template DNA bound to four DNA polymerases. The complexes shown in (a), (b), and (d) are co-crystal structures, 
whereas the complex in part (c) is a homology model [1 *"]. These four structures have been aligned by the first two base pairs at the primer 
terminus. The fingers of these structures are colored blue, the palms are red, and the thumbs are green. Secondary structural elements at the 
primer terminus that help bind nucleotide are shown in cyan and those that contact DNA are yellow. The primer strand is shown in orange, while 
the template strand is brown, (a) Taq pol bound to DNA [4-]. As with the other three structures shown in this figure, the DNA stacks against 
the fingers and is contacted across the minor groove by the thumb domain, (b) The binary complex of WV-1 RT and DNA [27]. This structure 
does not have a nucleotide-binding a helix in the fingers domains. Instead, a p hairpin probably performs this function, (c) The model of DNA 
bound to RB69 gp43 [1 ••]. A likely DNA-binding a helix has been highlighted, tt appears that the thumb domain would have to move toward 
the primer terminus in order to bind DNA analogously to the other polymerases, (d) The ternary complex of rat pol p with DNA and ddNTP 
[25]. Domain D (purple) plays the role of the fingers and presents an a helix at the primer terminus. Domain B is analogous to other polymerase 
thumb domains and binds the minor groove of the duplex substrate. 



be involved in shuttling newly incorporated mismatched 
nucleotides to the 3-5' exonucleolytic site of T7 pol [2 M J. 

Several recently determined structures of human pol P 
complexcd with substrate DNA also address the issue 
of fidelity. First, polymerases are more error-prone when 
the native, catalytic Mg2+ is replaced by Mn2+ [41-43). 
Crystalline pol p is capable of adding a nucleotide, 
untcmplated, onto a blunt-ended DNA in the presence 



of Mn 2 + (or other mutagenic metal ions), but not in 
the presence of Mg 2+ [7*]. This implies that mutagenic 
cations, perhaps owing to their increased affinity for 
carboxylates, cause mistakes by stabilizing non-native 
primcr-template-nucleotidc combinations. Also, observa- 
tions of the interactions of pol p with DNA in the crystal 
have led to a proposal for how this polymerase binds at 
the single-stranded DNA gap 15*] that is presumably its 
native substrate. The position of the primer-template in 



This Document has been provided by Technical Information Service of Purdue University and is copyright protected. 



Protein-nucleic acid interactions 





Current Opinion in Structural Biology 



Nucleotidyl transfer in the T7 pol co-crystal structure, (a) Structure of T7 DNA polymerase complexed with thioredoxin, DNA, and incoming 
ddNTP. The primer is depicted in magenta and the template is in yellow. White bases denote the portion of the DNA that has been modeled. 
Purple cylinders represent polymerase a helices and the red cylinders are a helices from thioredoxin. The DNA abuts the fingers at the primer 
terminus and the thumb contacts DNA across the minor groove. <b) Superposition of the active sites of T7 pol and pol p using only the DNAs 
from the T7 quaternary structure and the pol p ternary structure. For T7 pol, KF numbering (yellow) is used and the pol p residues are labeled in 
white. The figure shows that the essential carboxylate residues and metal ions are in a similar 3D arrangement, despite the obvious disagreement 
in the direction of the P strands that make up the palm domains, (c) Active site of T7 pol shown using KF numbering, with the T7 pol numbers 
shown below in parentheses. Metal ions A and B are contacted by two protein residues [D88 2(654) and D705(475)], the phosphates of the 
ddNTP and two waters. The putative position of the primer's 3'-OH, which would contact metal ion A, is shown by a red star. Figures provided 
by Tom Ellenberger. (d) Mechanism of nucleotidyl transfer in T7 pot, using the KF numbering system. The active site features two metal ions 
that stabilize the resulting pentacoordinated transition state. Metal ion A activates the primer's 3'-OH for attack on the a-phosphate of the dNTP. 
Metal ion B plays the dual role of stabilzing the negative charge that builds up on the leaving oxygen, and chelating the p- and -/-phosphates. 
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this model is the same as in the ternary complexes, with 
the single-stranded template making a sharp turn so that 
the next double-stranded portion of the DNA 

may interact with the 8kDa domain at the tip of the 
thumb. Kraut and colleagues [5*] postulate that such a 
kink in the template has advantages for fidelity, as it 
would minimize non-specific stacking contacts between 
the incoming dNTP and the primer; the template strand 
is also sharply bent in the T7 quaternary complex [2 m9 ]. 

Nucleotidyl transfer 

The crystal structures of T7 pol [2"J and of pol P [25] 
complcxcd with DNA and next correct dNTP show that 
these non-homologous enzymes use an identical catalytic 
mechanism to transfer a nucleotide onto the 3' end of a 
DNA primer. The active sites of these unrelated poly- 
merases share many essential and common features. When 
Ellenbcrger and colleagues [2**] aligned the two structures 
by superimposing their DNA substrates, the catalytically 
essential carboxylates from these disparate structures also 
superimposed (Figure 4b), as do the two metal ions 
bound to the triphosphate moiety in both structures. 
Although the terminal transferase family (to which pol 
P belongs) has evolved separately from other polymerase 
families and has a different catalytic domain topology, it 
shares a common three-dimensional arrangement of critical 
catalytic components. The structure and positions of the 
primer-template, dNTP and divalent metal ions are nearly 
identical in T7 pol and pol p. 

The structure of the T7 pol quaternary complex provides a 
basis for understanding the roles of certain residues known 
from other studies to be essential for nucleotidyl transfer 
([2**l; Figure 4c,4d). For the following discussion, the KF 
numbering scheme is used. Two conserved carboxylates, 
D705 and D882, are bridging ligands to the two catalytic 
metal ions. Significantly, these arc the only two carboxylate 
residues that are absolutely conserved between the pol 
a, pol I and reverse transcriptase families ([!••]; Figure 
2b). E883, which is conserved in the pol I family, docs 
not contact the metal ions, as is consistent with its 
absence in the pol a family. The two cations have been 
dubbed A and B. The ligation environment of metal 
ion A is completed by two water molecules and the 
pro-R oxygen of the nucleotide's a-phosphate. This cation 
would also be contacted by the 3'-OH of the primer, if 
it were present (a dideoxy-tcrminated primer was utilized 
in the crystallization experiment). This hydroxy! group 
would be perfectly positioned to undertake an in-line 
attack on the nucleotide's a-phosphate. Metal ion B is 
contacted by oxygens from all three of the nucleotide's 
phosphates, as well as a main-chain oxygen. Curiously 
the metal ion-ligand distances in the active site of T7 
pol are somewhat longer than expected (Figure 4c). 
Nevertheless, the features of this Michaelis complex make 
u clear that a proposed two metal ion mechanism for 
DNA polymerases [44], which was based on an analogy 



to the 3'-5' exonucleolytic two metal ion mechanism 
of KF [45], is essentially correct (Figure 4d). In this 
polymerase mechanism, metal ion A activates the 3'-OH of 
the primer for attack by lowering its pK a . This nuclcophilic 
attack at the a-phosphate generates a pentacoordinated 
intermediate, whose structure would be stabilized by both 
metal ions. A build-up of negative charge on the leaving 
oxygen of the P-phosphate could be stabilized by metal 
ion B. Once the reaction is complete, the pyrophosphate 
product and the metal ions dissociate, and the DNA must 
translate and rotate relative to the polymerase so that 
the newly formed primer terminus is correctly positioned 
in the primer-binding site for another catalytic cycle. 
The only significant deviation from the earlier proposal 
is that metal ion B appears to be playing the dual role 
of transition-state stabilization and chelation of both the 
P- and y-phosphates. 



Figure 5 
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Finger domain movement in DNA polymerases. The palm domains 
of the binary [5 # ] and ternary [27] complexes of pol p were used to 
superimpose the two protein structures. The binary (protein and DNA) 
form of the polymerase is shown in the darker shade and only the 
fingers domain of the ternary (protein and DNA and dNTP) form is 
shown in the lighter shade. The fingers dose down in the presence of 
the nucleotide. 



Conformational changes 

The binding of dNTP and duplex DNA to both pol I 
family and pol P family DNA polymerases produces large 
changes in the orientations of the fingers and the thumb 
domains relative to the palm domain. A comparison of 
the binary- complexes of polymerases and DNA with 
ternary complexes containing an incoming dNTP shows 
that the fingers close down on the DNA upon binding 
of the nucleotide in both pol p ([5*\; Figure 5) and 
pol-Mike enzymes [2**,4"]. A comparison of the Taq and 
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T7 pol complexes shows a few angstroms difference in the 
position of the primer terminus in these two complexes. 
Although it is unclear whether these conformational 
differences arc pan of the polymerase catalytic cycle or 
arc influenced by crystallographic artifacts, they provide a 
rational explanation for the kinetic observation [36,46-48] 
that the rate-limiting step in the polymerase reaction 
pathway occurs after the binding of dNTP [2",5 # ]. The 
thumb domains of polymerases also move in response to 
binding DNA [4**, 11,25,49], and this mobility may also 
play a role in the catalytic cycle. 

Conclusions 

Principal among the many new structural and functional 
insights into DNA polymerases obtained during the last 
year have been the structure determinations of a pol 
a family DNA polymerase and a T7 DNA polymerase 
complexed with DNA and nucleotide substrates. The 
structure of RB69 gp43 strongly suggests how the 
replisomcs of this and related polymerases are organized. 
The structures of Taq pol with DNA, pol p with various 
metal ions and substrates, and the quaternary structure 
of T7 DNA polymerase all give a much needed insight 
into the workings of DNA polymerases. These studies, 
taken together with the results of mutagenic experiments, 
suggest the functions of the domains that make up poly- 
merases: the fingers are involved in correctly positioning 
the template and the complementary NTP relative to 
the catalytic metal ions, and undergo a substrate-induced 
conformational change that may be important to catalysis; 
the palm domain harbors two completely conserved 
carboxylate residues that bind the essential metal ions 
and also contains residues that may detect replication 
mistakes; and the thumb domain is important for DNA 
positioning and proccssivity. Additionally, the catalytic 
mechanism of nucleotidyl transfer in the polymerase 
reaction may be universal, involves two metal ions and 
is very similar to that of the proof-reading exonucleases. 
Though the model of RB69 gp43 with DNA and accessory 
proteins is instructive, it is essential to determine the 
crystal structures of this and similar polymerases with their 
substrates and associated factors in order to confirm and 
extend this model. 

Perhaps the most striking findings from these poly- 
merase structures, as anticipated by the limited sequence 
similarity among families, are the differences in the 
structures of the fingers and thumb domains of poly- 
merases from different families. The thumb and fingers 
domain structures are completely unrelated among the 
four DNA polymerase families now known: pol I, pol 
a, pol p and reverse transcriptase. Nevertheless they 
function similarly in all families, using in most cases 
analogous secondary structural elements. Why are these 
domains, so essential to the polymerase reaction, derived 
from different ancestors? It would of interest to establish 
the structure of the bacterial DNA polymerase III, 
which show f s no recognizable sequence similarity to other 



polymerases. It may possess yet another polymerase fold 
and could yield clues to understanding the surprising 
differences among these vital enzymes. 
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Getting a grip: polymerases and their substrate complexes 

Joachim Jager* and Janice D Patat 



Underpinned by a database of more than a dozen different 
crystaJ structures, an increasingly complete and coherent 
picture of polymerase structure and function is emerging. 
Recently determined structures of DNA and RNA polymerases 
have revealed some of the molecular features and structural 
changes governing catalysis, oligomerization, processivity and 
fidelity. Despite having minimal similarities in sequence and 
protein topology, the polymerases all display a functionally 
analogous set of subdomains that bind the primer, template and 
nucleotide substrates in similar though not identical fashions. 
The two-metal-ion mechanism for nucleotide incorporation, 
however, is shared even by nonhomologous polymerases. ' 
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Abbreviations 

3Dpol poliovirus 3D RNA polymerase 
S. stearothermophilus fragment 
ONA polymerase 
double-stranded 
Klenow fragment 
non-nucleoside inhibitor 
Protein Data Bank 
DNA polymerase p 
polymerase I 
RNA polymerase 
reverse transcriptase 



BF 

DNAP 
ds 
KF 
NNI 
PDB 
poip 
poll 
RNAP 
RT 



Introduction 

Replication and transcription arc of vital importance to all 
living organisms. These processes, catalyzed by the polynu- 
cleotide polymerases, must be carried out in a precise and 
efficient manner. The past few years have seen an explo- 
sive growth in the number of polymerase structures 
determined and a corresponding growth in the understand- 
ing of polymerase activities. This review will focus 
primarily on a selected subset of the most recent structures 

%'l ! m !" C i fr ° m cach of che four polymerase classes 
Uab e 1). These structures include three DNA-dependent 
UNA polymerases [1--3-] i n complexes with substrates and 
produ Cts> an RNA-dependent DNA polymerase [4-] and a 
U.NA-dependenc RNA polymerase [5-] bound, respective- 
i>, to RNA and protein inhibitors, and also the apoenzvme 
structure of an RXA-dependcnt RNA polymerase [6'].' 

The crystal structure of the Klenow fragment (KF) of 
txkmckm coll polymerase I (pol I) was the first polymerase 
sructure determined [7]. The overall shape of the polv- 
merase domain was likened to a right hand, with sudomains 



termed fingers, palm and thumb. This analogy has proven 
useful for all polymerase structures solved to date. A com- 
parison with the structure of kanamycin nucleotidyl 
transferase [8,9] shows distinct differences in the palm sub- 
domains, indicating that the structures belong to two 
evolutionary distinct groups [10]. Although we will refer to 
these groups as the polymerase family and the nucleotidyl 
transferase family, both of these evolutionary families 
belong functionally to the nucleotidyl transferase enzyme 
class (Enzyme Commission number 2.7.7.7). With the 
exception of DNA polymerase p (po! p), all the polymerase 
structures determined to date belong to the polymerase 
family. Nucleotidyl transferases are not responsible for 
genome replication and transcription, rather they perform 
more specialized tasks, such as DNA repair and RNA mat- 
uration. 

DNA-dependent DNA polymerases 

The most thoroughly studied polymerases are those of the 
pol I family [11-13]; also referred to as 'family A DNA 
polymerases (DNAPs) [14]. Since the structure of the KF 
was determined in 1985 [7], the structures of three homol- 
ogous enzymes have been solved. These structures, from 
bacteriophage T7 (T7 DNAP [I-]), Thermusjzauatkus (Taq 

SStf fI5 ' I6I) Ba€UiuS ^rothe7mophilus 

I 2 *' 17 ]), show chat the polymerase domains of the pol 
I enzymes are nearly identical to each other (Figure 1). In 
addition to the pol I structures, the important structure of 
gp43 from bacteriophage RB69, a member of the poly- 
merase a family (also referred to as the family B 
polymerases [14]) of rcpiicativc polymerases, has been 
recently determined [18], but will not be discussed here 
as it has been thoroughly reviewed previously [19]. 

Although a structure of KF with DNA duplex located at its 
polymerase active site remains elusive, several such com- 
plexes of other pol I. enzymes have been obtained 
(J able 1). In particular, the high resolution structures of T7 
DNAP and BF complexed with nucleic acids give more 
insights into the structure and mobility of the polymerase 
subdomain, as well as unprecedented structural details 
about substrate and cofactor binding in the pol I famik In 
the 2.2 A structure of T7 DNAR the cnzvmc, bound to its 
processivity factor £. coli thioredoxin, is trapped at a stage 
just prior to phosphoryl transfer. The complex contains 
dideoxy-terminated primer-template DNA, the next 
incoming nucleotide and two magnesium ions. The 1.8 \ 
BF structures show primer-template duplexes before and 
after the incorporation of nucleotides within the crystal. 

The cacalytic core of the polymerase palm subdomain is a 
common folding unit and it appears in manv different 
structural contexts [6-.20.2I J. It consists of a three-strand- 
ed antiparallel (3 sheet flanked bv two a helices. In 
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Table 1 



Recently determined polymerase structures. 



Organism 


Class 


Template 


Size 


£ co// 

T. aquaticus 

T. aquaticus 

fi stf*amthf*rmnnhi/i t<: MM 

8. stearothermophHus (II) 


DNA pol 
DNA pol 
DNA pol 

OKI A nn! 

DNA pol 


DNA 
DNA 
DNA 

nioA 
UlNA 

DNA 


Fragment 

Full 
Fragment 

Full 
Fragment 


RB69 phage 
T7 phage 


DNA pol 
DNA pol 


DNA 
DNA 


Full 
Full 


Human 


DNA NT 


DNA 


Full 


Hl\M 
T7 phage 
£ co// 
Poliovirus 


DNA pol 
RNA pol 
RNA pol 
RNA pol 


RNA or DNA 
DNA 
DNA 
RNA 


Full 
Full 
Fragment 
Partiaf 



Complex formed 



Resolution (A) PDB code Reference 



p-thio-thymidine and zinc 
dsDNA (8/8-mer) 
dCTP 
Apo 

dsDNA{11/9-mer) 
and magnesium 
GMP 

dsDNA (13/10-mer), 
magnesium, ddGTP 
and £ co// thioredoxin 
Gapped dsDNA(16/9/5-mer), 
ddCTP and magnesium 
RNA pseudoknot (33-mer) 
T7 lysozyme 
Apo 

Apo (calcium ions) 



2.1 


1krp,1ksp 


[24] 


2.8 


1tau 


116] 


2.5 


5ktq 


[57] 


2.1 


1bdp 


[17] 


1.8 


2bdp 


[2-] 


2.8 


1waj 


[18] 


2.2 


1t7p 


[1*3 


2.2 


1bpy 


[3-] 


4.8 


Ihuv 


[4-J 


2.8 


1aro 


[5-] 


2.5 


1bdf 


[56] 


2.6 


1rdr 


[6-] 



NT, nucleotidyl transferase. 

polymerases, this region contains the most highly con- 
served residues and is used as a binding platform for the 
DNA primer, dNTPs and the critical divalent metal ions, 
which are bound by a network of charged interactions. In 
the ternary complex of T7 DNAP, for example, metals A 
and B are coordinated by rwo strictly conserved aspartate 
sidechains in sequence motifs A and C [12,18], a carbonyl 
oxygen, two water molecules and by the a, p and y phos- 
phate oxygens of the incoming NTP. A hydroxyl group at 
the 3' end of the primer would be located within 3.1 A of 
the a phosphorous atom and thus would be capable of an 
in-line attack (see Figure 3b in [I*]). Even though a water 
molecule appears to occupy the position of metal A in BF 
polymerase, the active site geometry remains very similar 
to that observed in T7 DNAP. As had been predicted ear- 
Figure 1 



lier [22,23], the two ions thus serve multiple 
purposes: compensation for the negative charges both on 
strictly conserved acidic residues and on the incoming 
NTP; stabilization of the negative charge on the attacking 
3' hydroxyl oxygen atom; and stabilization of the pentaco- 
valent transition state [l"-3\ 19,24]. 

In all recently determined pol I structures, the thumb and 
fingers subdomains are found to be in close contact 
([15,17,25]; J Jager, unpublished data). Thus, the apparent- 
ly large polymerase cleft that had been proposed to be the 
pathway for the growing primer-template [7] is too narrow 
to accommodate duplex DNA. The narrow channel that 
remains between the fingers and thumb presumably 
enables the entry of NTPs and the exit of pyrophosphate 




Stereo superposition of DNA polymerases 
from £ co// (dashed blue lines), 
B. stearothermophHus (red lines) and T7 
bacteriophage (green lines). The overall 
structures of the polymerase domains are 
strikingly well conserved among the different 
members of the pol I subfamily. The sequence 
identities based upon the structural 
alignments shown are only 46% and 31% for 
BF and T7 DNAP, respectively. The a carbons 
of the KF polymerase domain (residues 
545-928) superimpose on equivalent atoms 
in 17 DNAP and BF with root mean square 
deviations^ 1.96 A (261 matching residues) 
and 1 .26 A (333 matching residues), 
respectively. Note the large differences in the 
exonuclease domains (bottom half). 
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Figure 2 




Structure of T7 DNA polymerase complexed with dsDNA and £ coli 
thioredoxin. The polymerase domain is viewed from above into the 
polymerase active site, showing the ^ end of the primer strand. The 
fingers and thumb domains are in close contact The two protrusions on 
the thumb domain, together with the large exonuclease domain (exo), 
almost surround the DNA duplex. The template strand does not enter the 
active site through the cleft between the fingers and thumb domains; 
instead it traverses the fingers domain. The orientation of the 
primer-template is very similar in all poll like enzymes. Furthermore, it 
seems plausible that this orientation is preserved in all other polymerases 
as a result of the conservation of the core of the palm domains. 



after catalysis (Figure 2). In the KF, this channel is lined 
with conserved residues His734, Arg754, Lys758, Tyr766 
and, on the opposite side, conserved residues Lys635 and 
\rg682. The X-ray structures of Taq DNAP, T7 DNAP and 
BF complexed with DNA reveal that the 3' end of the 
primer is positioned at the polymerase active site, with the 
duplex clamped down by the thumb and the 5' end of the 
template stand continuing past the active site and travers- 
ing che fingers. The clamping of the duplex is enhanced by 
the tip of the thumb. This structural feature is found in all 
pol I-like polymerases and may well control the processivi- 
ty of these enzymes, as they usually work without auxiliary 
processivity factors [26]. In T7 DNAP, the thumb extends 
more than 20 A over the nucleic acid backbone, thereby 
surrounding more than 60% of the duplex (Figure 2). 

The junction between the palm and fingers subdomains 
plays a crucial role in the precise formation and recognition 
of canonical Watson-Crick base pairs. Together with the 
terminal base pair of the primer-template, residues from 
helices O and Q in the pol I polymerases form a tightly 
constrained binding pocket that is unsuited for mis- 
matched base pairs Distortions are sensed by a range 
of van der \Vaals\ stacking and hydrogen bond interac- 
tions. The residues involved in mismatch recognition are 
strictly conserved within the pol I subramilv. In KF, these 
residues are Arg668, Tyr766. Giy767. Asn845, Gln849 and 
Lys857. Residues corresponding to those from helix Q in 
KF have been identified even in the RXA-dependent 
R-\A polymerases, following considerable structure-based 



Figure 3 



Steml Stem2 Steml Stem2 
GGGAG/@QcCC^U DuCAGUCGGGAAlA AlACUjS^ 




Interaction of an RNA pseudoknot inhibitor (sequence given at top) 
with HIV-1 RT. The molecular surface rendering of HIV-1 RT indicates 
the contact regions with the RNA. Surface residues less than 5 A 
away from the inhibitor are shown in the lightest gray. Stem S2 at the 
3' end of the pseudoknot forms strong electrostatic interactions with 
residues in the thumb and fingers domains. The 5' end is found in the 
vicinity of the RNaseH active site. Loop L2 (foreground) interacts with 
the p5l connection domain. 



realignment [6\27,28], This conservation may indicate 
that the elaborate mechanisms governing template recog- 
nition and fidelity in pol Mike polymerases may function 
in other polymerase classes as well. 

In T7 DNAP, the formation of the recognition pocket is 
accompanied by a previously unsuspected tightening of the 
protein matrix around the terminal base pair, through the 
rotation of helices 0, 01, 02 and P (fingers) towards the sub- 
strates. Similar motions may occur during the catalytic cycle 
of Taq DNAP and BF, but are not observed in the binary Taq 
DNAP-DNA and BF-DNA co-crystal structures, presum- 
ably because they arc not trapped at the stage of nucleotide 
incorporation. These complexes resemble pre incorporation 
or post-translocation complexes. In contrast to the T7 DNAP 
and polymerase p ([3*], discussed below) co-crystal struc- 
tures, the 3' end of the primer in the BF complex is not 
terminated and. thus, is available for turn-over. Difference 
Fourier maps of BF complexes at near atomic resolution, 
before and after the addition of dNTPs, clearly show that the 
enzyme incorporated the next correct nucleotide. Ultimately, 
time-resolved X-ray crystallography may allow us to follow 
catalysis within the crystalline environment, provided that 
lattice interactions do nor. interfere with the motions of the 
fingers subdomain or che primer-template. 
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Nucleotidyl transferases 

DNA polymerase p (pol P) participates in the mammalian 
base excision DNA repair pathway. Crystal structures of 
rat DNA polymerase P showed that it has two domains — 
an N-terminal 8 kDa domain with DNA lyase activity 
and a C-terminal 31 kDa domain with nucleotidyl trans- 
ferase activity [20,29,30]. Like the pol I polymerase 
domain, the nucleotidyl transferase domain was 
described using analogy to a right hand. For the analogy 
to be useful, however, we have swapped the fingers and 
thumb designations, so that analogous structures serve 
analogous functions [31]. Thus, the thumb subdomain of 
pol P is adjacent to the 8 kDa domain and the fingers sub- 
domain is at the C terminus of the protein. 

The in vivo activity of pol P has been elucidated in three 
recently determined structures [3 # J. These structures are 
of human pol P complexed with either a one-nucleotide 
gapped DNA duplex, a dideoxynucleotide-terminated 
gapped duplex and incoming ddCTP, or nicked DNA 
that would be the result of a single nucleotide incorpora- 
tion into gapped DNA. The structures show that when 
the natural gapped DNA substrate is bound, the 8 kDa 
domain reaches from the thumb subdomain over the 
active site cleft, which is perhaps analogous to the con- 
tact between the fingers and thumb of pol I enzymes. 
The DNA template strand has a 90° kink located to the 
3' side of the template base and the downstream DNA 
duplex traverses the fingers and interacts with the 8 kDa 
domain. When the incoming nucleotide is in place, the 
fingers and 8 kDa domains close down more tightly 
around the DNA, forming the active conformation of the 
catalytic site. Two magnesium ions are found in the 
active site. One ion, metal B, forms tridentate contacts to 
the a, P and y phosphates of the incoming dNTP. With 
respect to the primer-template and dNTP, metal A is 
positioned as in the T7 DNAP structure, in order to acti- 
vate the 3' hydroxyl of the primer strand. The base of the 
incoming dNTP fits into a pocket formed by residues of 
the fingers and the terminal base pair and is positioned 
such that the a phosphate is aligned with the 3' hydrox- 
yl of the primer strand. The protein also forms hydrogen 
bonds and van der Waals' interactions with the minor 
groove of the newly formed base pair. Both the minor 
groove interactions and nucleotide-binding pocket select 
for a correctly formed Watson-Crick base pair. 

RNA-dependent DNA polymerases 

The first structure of an RNA-dependent DNA poly- 
merase, that of HIV-1 reverse transcriptase (RT) 
complexed with the non-nucleoside inhibitor (NNI) 
Nevirapine, was described in 1992 [32] and was subse- 
quently refined to 2.9 A resolution [33]. The 
three-dimensional structure reveals a strikingly asymmetric 
heterodimer consisting of two differently folded subunits 
(molecular weights 66 kDa and 51 kDa). The p66 subunit 
is characterized by a large polymerase cleft composed of 
subdomains named, in analogy to the pol I enzyme family, 



fingers, palm and thumb. A connection domain links the 
polymerase domain to a RNaseH domain. The p51 subunit 
has the same sequence as the N-terminal 440 residues of 
p66 [34], but it lacks the RNaseH domain. High resolution 
crystal structures of RT fragments have also been reported 
by Unge et a I [35] (residues 1-216 of HIV-1 RT, Protein 
Data Bank [PDB] code lhar) and Georgiadis et al [21] 
(residues 10-278 of Moloney murine leukemia virus RT, 
PDB code 1 mml), A comparison of several RT crystal struc- 
tures shows that the protein appears to have a specific 
flexibility that allows the rotation of the polymerase domain 
relative to the rest of the molecule [36]. The observed 
swiveling motion may allow RT to accommodate the rota- 
tional and translational movements of the growing nucleic 
acid duplex, which present an especial problem for RT 
because it uses an asymmetric molecule (tRNA Lys3) as 
the primer for first strand synthesis. 

The crystal form of HIV-1 RT first obtained by Kohlstaedt 
et al. [32] served to elucidate at medium resolution the co- 
crystal structures of other NNI complexes [37]. The report 
of high resolution structures (to 2.2 A) by Stuart and co- 
workers [38,39] was a major breakthrough, however, as it 
provided a very detailed description of the NNI-binding 
pocket. Until then, inaccuracies and the lack of solvent mol- 
ecules near the NNI-binding site in the previously reported 
models had hampered structure-based drug design studies. 

A complex of HIV-1 RT with a double-stranded 
(ds)DNA primer-template was described by Arnold and 
co-workers [40]. The nucleic acid is bound to the large 
polymerase cleft and stretches from the polymerase core 
to the RNaseH active site. The duplex deviates signifi- ; 
cantly from ideal A or B-form. The DNA conformation 
near the polymerase active site more closely resembles 
A-form, whereas after a pronounced kink 6-7 base pairs 
from the polymerase site, it is almost B-form. 

A kink in the nucleic acid was also observed in die recently 
reported structure of HIV-1 RT co-crystallized with an RNA 
pseudoknot [4']. This pseudoknot, a high-affinity aptamer 
for RT that was identified by SELEX [41], competitively ■ 
inhibits the binding of primer-template DNA [42]. The . 
RNA inhibitor is kinked by 60° from straight co-axial stack- 
ing of stems SI and S2. The kink creates a curved shape that 
optimizes the extensive contacts between the RNA 
inhibitor and both subunits of the RT heterodimer^ 
(Figure 3). Through electrostatic interactions of stem S2 and * 
loop LI with several basic residues in both helix I of the p66 
thumb and the p66 fingers subdomains, the RNA stabilizes 
a conformation of the polymerase in which the tip of the 
thumb and the fingers subdomains are in close contact. It is 
interesting to note that RNA stem S2, which is adjacent to 
the polymerase active site, does not interact with the P rib- 
bon (residues 58-77) lining the fingers subdomain. The ; 
binding mode of the pseudoknot is more reminiscent of that ' 
observed in T7 DNAP, in which the template strand tra-- 
verses the fingers. The pseudoknot-binding surface on the * 
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Figure 4 



Schematic stereo drawing of the T7 
RNAP-T7 lysozyme complex. The lysozyme 
mostly interacts with residues from the N* 
terminal domain (orange); the binding site is 
far from the polymerase active site, 
suggesting an indirect mechanism of 
inhibition. The overall structure of this 
enzyme is not reminiscent of those of other 
polymerases, although the pafm and fingers 
domains display striking homology to £ co// 
DNA polymerase I (KF). The coloring 
scheme indicating subdomains and modules 
is as follows: N-terminal domain (orange), 
thumb (green), paJm (red), palm insertion 
(yellow), fingers (dark blue), specificity loop 
or 'pinky' (black), extended foot (cyan) and 
lysozyme (magenta). 





Current Opinion in Structural Biology 



p66-p5i hctcrodimer partially overlaps with the binding 
surface for duplex DNA [40]. The RNA inhibitor has more 
contacts with the p51 subunit than the duplex DNA docs 
and it may possibly overlap the binding site for tRNA Lys3. 

DNA-dependent RNA polymerases 

Bacteriophage T7 RNA polymerase (T7 RNAP) is a 98 kDa 
monomeric enzyme that catalyzes RNA transcription in a 
promoter-specific manner. A low resolution apoenzyme 
structure was reported by Sousa etai in 1993 [43]. The out- 
line of the T7 RNAP structure is irregular, but it displays 
typical features that are characteristic of the pol I family of 
polymerases. The thumb, palm and fingers subdomains are 
augmented by a long extended specificity loop ('pinky*), a 
palm insertion and an extended foot subdomain. Jeruzalmi 
and Steitz [5*] have recently reported the 2.8 A crystal struc- 
ture of the T7 RNAP-T7 lysozyme complex (Figure 4). In 
vwo, this complex controls gene expression during T7 
phage infection of £. coll A large body of biochemical and 
genetic data on promoter-DNA interactions [44] and on T7 
lysozyme inhibition [45,46] has allowed detailed structure 
and function analyses. The new T7 RNAP crystal structure 
has a similar overall shape to that of the apoenzyme previ- 
ously presented by Sousa a at. ([43], PDB code 2rnp); 
however, the connectivity in the N-termina! domain and the 
secondary-structure assignments throughout the poly- 
merase have been revised and redefined. 

The N-terminal 325 residues of T7 RNAP have no counter- 
part in other polymerase structures. This domain forms the 
front wall of the polymerase cleft and appears to be involved 
m promoter binding. Mutations in this area are detrimental 
to processive RNA synthesis. In contrast to the DNA pol I 
subfamily, the tip of the T7 RNAP thumb does not contain 
any strictly conserved residues. It also differs in shape from 
those found in other polymerases, indicating that the thumb 
subdomain may play a different role in substrate binding. 
Hrocess.viry may be controlled by other mechanisms in T7 
KNAP. The fingers subdomain. located at the opposite side 



of the active site cleft, has a considerably higher degree of 
sequence conservation (J Jager, unpublished data) and a 
curved shape that is compatible with the duplex template. 
Jeruzalmi and Steitz [5*] present clear evidence that the 
organization of the thumb subdomain in T7 RNAP is remi- 
niscent of the corresponding domain in KF and T7 DNAP. 
Therefore, the incoming duplex may pass the fingers in T7 
RNAP and then the template strand may kink sharply in 
order to enter the polymerase active site. Confirmation of 
this, however, requires further studies of large DNA opera- 
tor-polymerase complexes and the results of such co-crystal 
structures are eagerly awaited. 

RNA-dependent RNA polymerases 

Poliovirus 3D RNA polymerase (3DpoI) is the only 
RNA-dependent RNA polymerase whose three-dimen- 
sional structure has been determined [6*]. This class of 
polymerases is responsible for replicating and transcrib- 
ing the genomes of nonretroviral RNA viruses. Even 
though the sequence similarity to other polymerase 
classes is very limited, the 3Dpol structure has the char- 
acteristic fingers, palm and thumb subdomains found in 
all polymerases, clearly indicating that the RNA-depen- 
dent RNA polymerases are evolutionarily related to the 
RTs and the DNA-dependent DNA and RNA poly- 
merases. ^In particular, the core structure of the palm 
subdomain has the same topology as in the other poly- 
merases and is composed of the four sequence motifs. 
A-D [27], that are common to all the polymerase fami- 
lies. The strictly conserved aspartate in motif A and the 
first of the two adjacent aspartates in the YGDD 
sequence of motif C are structurally equivalent to the 
two aspartates conserved in the other polymerase classes. 
Mutations of these aspartates abolish 3Dpol activity and 
mutation of the second aspartate in motif C alters the 
metal specificity of the enzyme [47]. 

The thumb and fingers subdomains of 3Dpol have little 
structural similarity to those in the other polymerases. 
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Figure 5 
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Stereo drawing of intermolecular contacts 
between polio virus RNA polymerase 
molecules. Adjacent polymerase molecules 
(alternately shown In light and dark gray) pack 
in a head-to-tail manner, forming a fiber of 
polymerase molecules aiong a twofold screw 
axis in the crystal lattice. The interface 
between the front of the thumb of one 
molecule and the back of the palm of the next 
buries a surface area of 1 480 A 2 and involves 
at least 23 amino acid sidechains in a variety 
of hydrophobic, ionic and hydrogen bond 
interactions [7*]. Subdomains of the 
polymerase are identified as fingers (F) t palm 
(P) and thumb (T). 



although they presumably play similar roles in primer, 
template and NTP binding. The thumb subdomain is 
largely composed of sequences from the C terminus of 
the protein and is predominantly a helical. Helix K is 
positioned similarly to helix H in-HIV-1 RT, which binds 
in the minor groove of the primer-template duplex (40]. 
Sequence motif E, common only to the RNA-dependent 
polymerases, is located between the palm and thumb 
subdomains. This motif forms a p-turn-P structure that 
was termed the 'primer grip* in HIV-1 RT [40] and that 
interacts with a P strand located at the beginning of the 
thumb subdomain in both 3Dpol and RT. Interestingly, 
this strand in the thumb of the poliovirus enzyme con- 
tains a valine residue that when mutated to isoleucine 
(V391I) specifically disrupts the interaction between the 
polymerase and the viral protein 3AB, both in the yeast 
two-hybrid system and in vitro [48]. 3AB is the precursor 
to the 22 amino acid protein VPg, which primes RNA 
synthesis by 3Dpol [49]. The fingers of 3DpoI arc com- 
posed of sequences located N terminal to motif A and 
between motifs A and B. Only the lower portion of the 
fingers is ordered in the structure. Ribonucleotides cross- 
link to lysine residues in two peptide segments of 3Dpol 
(residues 266-286 and residues 57-74) [50] that are locat- 
ed adjacent to the ordered portions of the fingers. 



A detailed understanding of substrate binding by 3Dpol 
awaits future structure determinations. 

The poliovirus polymerase crystal lattice reveals an 
unusual higher order structure that is believed to be 
important for polymerase activity. 3DpoI is known to 
bind and extend primer-template RNAs in a highly 
cooperative manner with respect to polymerase concen- 
tration [51], suggesting that the polymerase is active as 
an oligomer. Polymerase-polymerase interactions have 
been observed physically, by chemical cross-linking in 
solution [51], and genetically, in the yeast two-hybrid 
interaction assay [48], In the crystals of 3DpoI, one inter- 
action between polymerase molecules is an extensive, 
interface between the front of the thumb of one mole- 
cule and the back of the palm of an adjacent molecule. 
This interaction produces a fiber of head-to-tail poly- 
merase molecules (Figure 5) along a twofold screw axis 
in the crystal lattice and may correlate with the coopera- 
tive RNA-binding activity of 3Dpol. The binding site 
size for a single 3Dpol molecule on single-stranded RNA [ 
is approximately 10 nucleotidesJ52], a length that would |t 
be sufficient to span the 46 A distance between two ; 
active sites along the polymerase fiber The poly- : 
merase-polymerase interactions and the cooperative .t 



This Document has been provided by Technical Information Service of Purdue University and is copyright protected. 



Polymerases and their substrate complexes Jager and Pata 27 



RNA binding activity of poliovirus 3Dpol may not be 
unique. DNA pol P cooperatively binds substrate DNA, 
nn activity that may serve to localize pol P to sites that 
require DNA repair [53-55]. 

Conclusions 

, The large database of polymerase structures currently 
available shows that all the polymerase domains have a 
common overall shape that resembles a right hand. As 
discussed above, the thumb, fingers and palm subdo- 
mains play similar functional roles, despite having 
minimal similarities in sequence and architecture. The 
similarities lead to a generalized image of polymerases, 
in which the primer-template duplex is bound by the 
thumb and positioned such that the 3' end of the primer 
is located at the active site in the palm subdomain and 
the 5' end of the template strand continues across the 
fingers subdomain. The incoming nucleotide is bound 
by both the fingers and palm subdomains, presumably 
entering through the cleft between the fingers and 
thumb, near the 3' end of the primer. Even though the 
duplex does not pass through the cleft between the fin- 
gers and thumb, it is partially encircled by the thumb, 
with helices at the base of the thumb tracking the minor 
groove of the growing duplex and the tip of the thumb 
bending across the top of the duplex. Finally, the most 
universal feature of the polymerases is the two-metal-ion 
catalytic mechanism, being conserved throughout all 
polymerase classes. 

The next few years promise to be exciting, as structural 
and functional polymerase studies continue to converge. 
It will be interesting to see how the structural concepts 
developed thus far apply to the processive multisubunit 
enzymes. Zhang and Darst [56] have taken one of the first 
steps in this direction, with the structure determination of 
the a subunit of £. coll RNA polymerase. Structural stud- 
ies of initiation complexes will be another important step 
in understanding polymerase activity. The elucidation of 
these structures and mechanisms will be eagerly awaited. 

Note added in proof 

The recently published structure of HIV-1 RT complexed 
with dNTP and a covalently tethered primer-template 
provides a detailed view of the RT polymerase active site 
[58]. As seen in the bacteriophage T7 DNAP-DNA com- 
plex [l*], che fingers subdomain tightens around the 
substrates and is transversed by the template strand. This 
new structure is an important step towards understanding 
nucleoside drug resistance. 
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DNA Polymerases 

Abstract 

To date the three-dimensional structures of 8 different nucleic acid polymerases or their subunits are 
known. Similar structures are found for all the domain which mediate the polymerase or 
transphosphorylation activity. All this domains have a hand-like structure with a palm, fingers and thumb 
subdomain. Three carboxylic side chains, which are always found in the same spatial arrangement, and two 
magnesium ions are directly involved in the transphosphorylation reaction. Two aspartatic acids are 
oriented nearly parallel to each other and coordinate two magnesium ions. The magnesium ions are also in 
close contact to phosphate and hydroxy groups of the primer and the incoming mononucleotide. One 
magnesium is thought to stabilize the transition state of the transphosphorylation. This two metal-ion 
mechanism is unique to polymerase. High processivity is mediated to the core of a polymerase by a 
ring-shaped protein complex. This processivity factor act as DNA clamps. The processivity factors are 
highly symmetrical and consist of two or three identical monomers. 
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1. Introduction 

Nucleic acid polymerases are enzymes found in all living cell. DNA polymerases are used the replicating of 
DNA, while RNA polymerases do the transcription of DNA into RNA. Further on, some DNA 
polymerases are able to repair DNA. All the polymerases involved in replication, transcription and repair of 
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DNA are template-directed. They need a single strand of DNA as a template to synthesize a second strand 
of nucleic acid which is complementary in sequence to the template strand. DNA polymerases add 
deoxynucleotides to the growing strand, while RNA polymerases add ribonucleotides. Only some special 
polymerases like the telomerases do not need a template to synthesize new DNA. 

In general, nucleic acid polymerases are catalysing the formation of a bond between a polynucleotide chain, 
which is called the primer, and a mononucleotide. The template strand is a cofactor for this reaction and 
also two magnesium ions as will be further discussed bellow. More precisely, the polymerisation reaction 
is a transphosphorylation. This is illustrated in Figure 1 . The educts are a primer with a free 3 f hydroxy 
group and a mononucleoside-triphosphate. The 3' oxygen makes a nucleophilic attack on the aplha 
phosphate (the innermost phosphate) of the mononucleotide and the beta and gamma phosphates are 
released as pyrophosphate. (So a nucleotide-monophosphate is transphosphorylated from a pyrophosphate 
to the 3' oxygen of the primer.) The transphosphorylation is energetically unfavourable, but the reaction is 
driven forward by the immediate hydrolysation of the pyrophosphate into two anorganic phosphates. This 
is catalysed by a pyrophosphatase. 

Figure 1 also illustrates that all template-dependend nucleic acid polymerases read the template strand in 3' 
to 5' direction and add mononucleotides with their 5' phosphate to the 3' end of the growing polynucleotide 
chain. Therefore the reaction is also called a 5-3 1 polymerisation. RNA polymerases are able to use a single 
mononucleotide as the primer, whereas DNA polymerases always need an oligonucleotide. Accordingly 
only RNA polymerases are able to synthesize a polynucleotide de novo. But DNA polymerases are able to 
add deoxy-mononucleotides either to a DNA or a RNA primer. Consequently DNA polymerases use RNA 
primers to initialize new DNA strands. 

According to their templates and their products, four types of template-dependend polymerases are known: 

DNA-dependent DNA polymerase 

are employed in all cells for replication and repair of the DNA. 

RNA-dependent DNA polymerase 

are found with retro viruses which use to copy RNA into DNA. This type of DNA polymerase are 
called reverse transcriptase (RT). 

DNA-dependent RNA polymerase 

are needed in all cells for the transcription of DNA. Primases are also RNA polymerases and they are 
responsible for the synthesis of the RNA primers needed for the replication of the DNA. 

RNA-dependent RNA polymerase 

are found with such RNA viruses which replicate by copying their RNA directly into new RNA. 



Before a cell is ready to devide into two daughter cells, the genom has to be duplicated. The replication of 
the DNA double helix is a complex process and several proteins are involved. Replication starts with the 
unwinding of the double helix and the formation of a replication fork. While the replication fork moves 
along the parent DNA, one new strand (the leading strand) is synthesized continously. The other strand (the 
lagging strand) is synthesized discountinously. As DNA polymerase read their template strands only in 3' to 
5' direction, one template strand has to be read against the direction of movement of the replication fork. 
This strand is the template for the lagging strand. The lagging strand is synthesized within fragments which 
are called Okazaki fragments and are 1000 to 2000 bases in length. 
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The synthesis of each Okasaki fragment requires a primer. In E. coli the primers are RNA oligonucleotides 
which are synthesized by a special RNA polymerase called DNA primase. Then the primers are extended 
by such DNA polymerases which have a high processivity and a high chain elongation rate. The RNA 
primers of the Okazaki fragments are replaced with DNA by another type of DNA polymerases, which 
have an additional 5-3* exonuclease activity to cut of the primer. Finally a DNA ligase joins the Okazaki 
fragments covalently. 

Thus there are several different polymerases involved in the replication of DNA in vivo. They do not only 
differ in their template and substrate specificity, but also in their processivity, their fidelity and their chain 
elongation rates. The processivity is the number of nucleotides a polymerase add to the growing strand 
before the polymerase dissociates from the template. The fidelity is a measure of the accuracy and 
describes how often a wrong nucleotide is incorporated. To keep the genetic information for the next 
generation a very high fidelity is required for the DNA replication. This is achieved by a proof-reading 
activity present with some DNA polymerases. The proof-reading activity is a 3 -5' exonuclease activity 
which removes mismatched nucleotides. 

Most organisms have several different DNA polymerases. E. coli for example has three different DNA 
polymerases called polymerase I (Pol I), polymerase II (Pol II) and polymerase m (Pol HI). Pol m has a 
high processivity and is the functional enzyme of DNA replication. Pol I has a low processivity and is 
responsible for replacing the RNA primers, which were used to initialize the new DNA strands, with DNA 
and for DNA repair. Less is known about Pol H Five different polymerases are known for eukariotic cells, 
named alpha, beta, gamma, delta and epsilon. Presumably polymerase alpha, delta and epsilon are involved 
in the replication of the nuclear DNA, whereas polymerase gamma is located in the mitochondria. 
Polymerase beta has no proof-reading activity and a very low processivity and is thought to act in DNA 
repair. 
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2. The known 3-dimensional structures of polymerases: An 
overview 

As outlined in the introduction cells are using several different polymerases for the tasks of DNA 
replication, transcription and repair. All this enzymes have been characterized biochemically and their 
genes are known. But less is know about their 3-dimensional structure. All known structures are deposited 
in the Brookhaven Protein Data Bank (PDB). To get a survey of the know structures, the PDB was 
searched (PDB Browse) for all polymerases and transcriptases. Currently (october 1996) the PDB contains 
solved crystall structures of 8 different polymerases or their subunits. 30 entries were found in total because 
more than one structure was solved for some polymerases. All the entries found are listed in Table 1 . 

5 of the 8 structures include the polymerase active site and the structure of this 5 proteins are described in 
more detail in this section. The other 3 structures are a 3'-5 ! exonuclease domain and two processivity 
factors. The structure of the processivity factors is further discussed in Section 4 . 

The first 3-dimensional structure of a nucleic acid polymerase to be solved was the structure of the Klenow 
fragment of Pol I of E. coli (Ollis etal. 1987 aY Pol I is a monomer with 928 amino acids and has three 
enzymatic activities: 5'-3' DNA polymerase, 3-5* exonuclease (proof-reading) and 5-3' exonuclease. The 
5-3' exonuclease acts in digesting RNA primers and in creating gaps during DNA repair which are refilled 
by the polymerase activity. Proteolytic cleavage of Pol I yields two fragments. The larger C-terminal 
fragment (Res 327-928) is called Klenow fragment and is a functional enzyme with 5-3' polymerase and 
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3 f -5 f exonuclease (proof-reading) activity. 

The second structure solved was reverse transcriptase (RT) from human immuno-deficiency virus (HIV) 
(Kohlstaedt et al. 1992) . RT is a heterodimer of the two proteins P66 (66 kD) and P51 (51 kD). Both 
proteins are encoded by the same gene. After translation a 14 kD C-terminal domain is cleaved from P66 
which yields P51. The C-terminal domain of P66 comprises a Ribonuclease H activity (5'-3 ! exonuclease). 
P66 also comprises the active polymerase site, whereas P51 has no enzymatic activity. This is surprising 
since both proteins have the same sequence. But the crystall structure reveals, that they fold into different 
spatial arrangements, although they have the same sequence and the same secondary structure elements. 
The tertiary structure of P51 is enzymatically inactive. 

The overall shape of P66 is strikingly similar to Klenow fragment (KF). Both proteins are resembling a 
hand which grips around a rod. Consequently the subdomains were named according to the anatomy of a 
hand. The palm domain is located at the bottom of the cleft which is flanked by the fingers on one side and 
the thumb on the other. Later on this overall hand-like shape was also found for two other polymerases: 
bacteriophage T7 RNA polymerase (Sousaet al 1993) and rat polymerase beta fPelletier et al. 1994) . The 
structure of the reverse transcriptase of Moloney murine leukemia virus (MMLV) is the fifth known 
structure of a polymerase (Georgiadis et al. 1995) . But only a fragment of the enzyme was crystallized, and 
this fragment contains only the fingers and a palm domains, while the thumb is missing. 

The hand-like shape common to all polymerases is illustrated in Figure 2 for the four polymerases which 
have a complete hand. Although the hand-like shape is found with all polymerases their 
sequence-similarity and their similarity in secondary structure is low. It is still an open question, whether 
the hand-like shape is a result of coevolution or a remaining of a common ancestor enzyme. Sousa(1996) 
argues for the existance of a superfamily of all polymerases. But the low sequence similarity and the great 
differences in the topology of the secondary structure elements found for the hand-like domain are more 
indicative of a homologous evolution, where the hand-like shape is a functional requirement. Little is 
known about the mechanical details of the translocation of the polymerases along the template. But it is 
obvious that the translocation must be precisely coupled to the transphosphorylation reaction and requieres 
a conformational change of the protein. Maybe the organisation in distinct palm, fingers and thumb 
domains, which are flexible against each other, is functionally related to the mechanism of translocation. 

Another feature found with all polymerases is the location of three acidic amino acids at the polymerase 
active site. They are listed in Table 2 for the five polymerases with known structure. The location and 
arrangement of the three side chains can be seen from the RasMol views included in Figure 2 . The three 
carboxylates are directly involved in the transphosphorylation reaction as described in detail in Section 3 . 
The active site with the three carboxylic side chains is located at the bottom of the cleft between the fingers 
and thumb domains and on the surface of the palm domain. Again the question remains open whether the 
location of the three carboxylic side chains is a result of analogous or homologous evolution. 



3. The mechanism of transphosphorylation 

The structure of rat polymerase beta f Pelletier et al. 1994 ) allows the best insight into the mechanism of the 
transphosphorylation reaction. Pol beta is a very simple polymerase with only a 5-3* polymerase activity, 
but no exonuclease or proof-reading activity. The enzyme is a monomer of 335 residues and was 
crystallized in complex with a template DNA strand, a primer DNA strand and a monomeric 
2\3'-dideoxycytisine-triphosphate (ddCTP). In addition the 3' end of the primer was terminated by a 
Z'^'-dideoxycytosine. The missing 3 f hydroxy group of the primer does not allow a transphosphorylation 
and freezes the complex in the initial state of the reaction. 



4 of 6 



5/1/2002 7:30 PM 



DNA Polymerases http://www.imbjena.de^ 




The arrangement at the active site is shown in Figure 3 . The primer as well as the incoming nucleotide are 
forming classical Watson-Crick hydrogen bonds with the template. The cytosine base of the incoming 
ddCTP stacks on the 3' cytosine of the primer. The 3 f hydroxyl group of the primer is not present in the 
crystal structure, but was add in order to illustrate the reaction mechanism. The 03' of the primer 
nucleophilically attacks the alpha phosphorous of the mononucleotide. This leads to a transition state where 
the alpha phosphorous is penta-coordinated to five oxygens. Initially the structure of the alpha phosphate is 
tetrahedral, but changes to planar-bipyramidal in the transition state. The transition state is stabilized by the 
left magnesium ion in Figure 3, whereas the right magnesium stabilizes the leaving pyrophosphate and also 
helps in positioning the incoming mononucleotide. The three aspartatic acid residues Asp 190, Asp 192 
and Asp 256 are crucial for placing the two magnesium ions, but they are not involved directly in catalysis. 



4. DNA clamps mediate high processivity 

The intact holoenzyme of E. coli Pol HI is a large assembly of 10 different protein subunits (alpha, beta, 
gamma, delta, delta 1 , epsilon, theta, tau, chi, psi). The polymerase activity is located on subunit alpha, and 
the 3-5* exonuclease activity (proof-reading) on subunit epsilon. An assemly of alpha, epsilon and theta is 
called the core polymerase complex and can perform DNA polymerisation, but with a low processivity. 
The complex dissoziates from the template after adding a few nucleotides to the growing chain. The 
complex gamma, consisting of the gamma, delta, delta 1 , chi and psi subunit, catalysizes to linkeage of two 
beta subunits to DNA and to the core complex. After the beta subunit is added to the core complex, the 
high processivity of the holoenzyme is recovered. Thus the beta subunit has no catalytic activity by itself, 
but mediates the high processivity to the core complex. 

The crystall structure of the beta subunit was solved by Konget al. 1992 and deposited in the PDB (2pol ). 
The protein is a dimer of two identical subunits. Each beta subunit has 366 amino acid residues. The shape 
of the dimer is shown in Figure 4 . The two monomers stick together head by tail and form a closed ring 
which wraps around a central pore. The ring has an overall diameter of 80 A, and the diameter of the pore 
is approximately 35 A. In the model shown in Figure 4 a DNA double helix is added to the experimentally 
derived protein structure to illustrate the presumable function of the beta subunit. The two monomers 
clamp around the DNA and, when linked to the core complex, they prevent the dissoziation of the 
polymerase/DNA complex. 

The tertiary structure of the beta subunits is highly symmetrical (Figure 5) and therefore quite beautiful. 48 
beta sheets are forming the outer surface, while 12 alpha helices are lining the inner surface. Each 
monomer consists of three domains and all of the three domains within one protein have the same 
secondary structure and nearly the same spacial arrangement. This is amazing because the amino acid 
sequences of the three domains are quite different. The structure of one such domain and its orientation 
towards the DNA is shown in Figure 6 . The domain consists of 8 beta sheets and 2 alpha helices. Either 
four beta sheets are arranged anti-parallel in a Greek Key motif. The domain is twofold symmetrical by 
itself. The sheets and helices are enclosing a core of closely packed side chains. This core stabilizes the 
hole domain. By assembling 6 such building blocks a rigid ring is constructed. The beta subunit does not 
recognize certain DNA sequences. The pore of the ring is wide enough to avoid contacts between protein 
side chains and DNA bases. Only the sugar-phosphate backbone is recognized by the beta subunit. The task 
of the beta subunit is to duct the DNA to or from the core of the polymerase complex regardless of the 
DNA sequence. 

Eukariotic cells have a processivity factor very similar to the beta subunit of E. coli. The protein is called 
proliferating cell nuclear antigene (PCNA). The tertiary structure Figure 7 is nearly identical to the beta 
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subunit, it also forms a ring enclosing the DNA helix. But with PCNA the ring consists of three rather than 
two monomers. The three monomers are identical. The function is the same for PCNA and the beta 
subunit. Both are DNA clamps mediating high processivity. 
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DNA Polymerases 



1. Transphosphorylation 



5 ApGpTpCpGpGpTpApC 3 ' _ 

s'ApTpG 5, + ° PPP 



Polymerase 



5 ApGpTpCpGpGpTpApC 3 ' 

3 'CpApTpG 5 ' 



PP 



2. Hydrolysis of pyrophosphate 



Pyrophosphatase 
pp + H 2 0 > p + p 



Figure!. 



The polymerisation reaction catalysed by DNA and RNA polymerases is a transphosphorylation. A 
mononucleoside-triphosphate is added to the 3' OH group of the growing chain by forming an phospho- 
ester bond between the alpha phosphate of the mononucleotide and the free 3' OH of the growing strand. 
So the alpha phosphate is incorporated in the backbone of the new polynucleotide chain, while the beta 
and gamma phophates are released as pyrophosphate. The transphosphorylation is energetically 
unfavourable, but the reaction is driven forward by the immediate hydrolysation of the pyrophosphate 
into two anorganic phosphates by a pyrophosphatase. 
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Table 1. 

Currently (october 1996) crystal structures of subunits or holoenzymes of 8 different proteins related to 
polynucleotide polymerases have been solved and deposited in the Brookhaven Protein Data Bank 
(PDB): 

1. 

DNA p ol ymerase beta from Rat 

2. 

Klenow fragment of DNA p ol yme rase I from E. coli 

3. 

RNA polymer ase from Bacteriophage T7 

4. 

Reverse transcriptase from HIV-1 

5. 

Catalytic fragment of reverse transcriptase from MMLV 

6. 

3'-5' Exonuclease domain of DNA polymerase from Bacteriophage T4 

7. 

Beta subunit (processivity factor) of DNA polymerase III from E. coli 

8. 

P CNA (p rocessivity factor) of eukaryotic DNA polymerases from Yeast 



1. DNA polymerase beta (Rat) 



Protein 


Ligand 


Resolution 


Reference 


PDB 

code 


31 KD catalytic 
domain (Res 91-335) 




2.3 A 


S away a et al. 
1994 


lbpb 


31 KD catalytic 
domain (Res 95-335) 




2.3 A 


Davies et al. 
1994 


lrpl 


31 KD catalytic 
domain (Res 91-335) 


Mn 2+ 


2.8 A 


Sawava et al. 
1994 


2bpc 


39 KD (Res 9-335) 


P0 4 


3.6 A 


Sawava et al. 
1994 


lbpd 


39 KD (Res 12-335) 


dATP 


3.9 A 


Sawaya et al. 


lbpe 
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1994 




39 KD (Res 9-335) 


ddCTP, template DNA, 
primer DNA, 2 Mg 2+ 


2.9 A 


Pelletier et al. 
1994 


2bpf 


39 KD (Res 9-335) 


ddCTP, template DNA, 
primer DNA 


3.6 A 


Pelletier et al. 
1994 


2bpg 



2. Klenow fragment of DNA polymerase I (E. coli) 



Protein 


Ligand 


Resolution 


Reference 


PDB code 


Klenow fragment 
(Res 326-928) 


Zn 2+ 


2.8 A 


Ollis et al. 
1985 a 


1dpi (C alpha 
coordinates only) 


Klenow fragment 
(Res 326-928) 


dCTP 


3.9 A 


Beese et al. 
1993 b 


lkfd 


Klenow fragment 
mutante (Res 326-928) 


DNA (Res 2- 
29), Zn 2+ 


3.2 A 


Beese et al. 
1994 a 


lkln 



3. RNA polymerase (Bacteriophage T7) 



Protein 


Ligand 


Resolution 


Reference 


PDB code 


Res 1-883 


Zn 2+ , Mn 2+ 


3.3 A 


Sousa et al. 1993 


2rnp (C alpha coordinates only) 



4. Reverse transcriptase (HIV-1) 



Protein 


Ligand 


Resolution 


Reference 


PDB code 


fingers and palm 
(Res 1-216) 


Zn 2+ 


2.2 A 


Ungeetal. 1994 


lhar 


ribonuclease H 
domain (Res 427- 
556) 


Zn 2+ 


2.4 A 


Davies et al. 
1991 


lhrh 


ribonuclease H 






Chattopadhyay 
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domain (Res 427- 
556) 


Zn 2+ 


2.8 A 


et al. 1993 


lrdh 


P66 (Res 1-556), 
P51 (Res 2-427) 


DNA template, 
DNA primer 


3.0 A 


Jacobo-Molina 
etal. 1993 


Ihmi (C alpha 

coordinates 

onlv) 


P66 (Res 1-554), 
P51 fRes 5-427^ 


Mg 2+ 


3.2 A 


Rodgers et al. 
1994 


lhmv 


P66 (Res 1-558), 
P51 fRe<? 1-427^ 


alpha-APA 


2.8 A 


Ding et al. 
1995 a 


lhni 


P66 (Res 1-558), 


TIBO 


3.0 A 


Ding et al. 
1995 b 


lhnv 


P66 (Res 2-556), 


Neveripin 


2.9 A 


Wane etal. 1994 


3hvt 


P66 (Res 1-543), 




2.4 A 


Esnouf et al. 
1995 


m 


P66 (Res 1-543), 


Benzodiazepinone 


2.2 A 


Ren etal. 1995 a 


lrth 


P66 (Res 1-543), 


Phenyl-thiothymine 


3.0 A 


Ren et al. 1995 a 


irti 


P66 (Res 4-539), 
P51 (Res 5-428) 


Nevirapine, Mg 


2.2 A 


Renet al. 1995 a 


lvrt 


P66 (Res 3-539), 
P51 (Res 4-428) 


alpha-APA 


2.4 A 


Ren etal. 1995 a 


lvru 



5. Catalytic fragment of reverse transcriptase (MMLV) 



Protein 


Ligand 


Resolution 


Reference 


PDB code 


catalytic fragment (Res 24-274) 




1.8 A 


Georeiadis et al. 1995 


lmml 



6. 3 f -5' Exonuclease domain of DNA polymerase (Bacteriophage T4) 
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r IUIC1I1 


T icthtiH 


AVwOUiLlllUll 




PDB code 


Res 2-370 


Zn 2+ , Mn 2+ 


2.2 A (298 K) 


Wang et al. 1996 


lnoy 


Res 2-370 




2.2 A (110 K) 


Wang etal. 1996 


lnoz 



7. Beta subunit (processivity factor) of DNA polymerase III (E. coli) 



Protein 


Ligand 


Resolution 


Reference 


PDB 

code 


2 monomers of identical sequence 
(Res 1-366) 




2.5 A 


Kong et al. 
1992 


2p_ol 



8. PCNA (processivity factor) of eukaryotic DNA polymerases (Y east) 



Protein 


Ligand 


Resolution 


Reference 


PDB code 


1 monomer (Res 1-258) 


2Hg 2+ 


2.3 A (synchrotron) 


Krishna et al. 1994 


Iplq 


1 monomer (Res 1-258) 




3.0 A 


Krishna etal. 1994 


lplr 
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Top View 



Side View 




P66 reverse transcriptase (HIV) 
( RasMol script ) 



Klenow fragment 
( RasMol script ) 



Pol beta (Rat) 
( RasMol script) 



RNA Pol (Bacteriophage T7) 
( RasMol script) 
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Figure 2. 

The domain containing the polymerase active site has a hand-like shape with fingers (blue), palm 
(magenta) and thumb (green). The hand-like shape was found for all polymerases for which the 
structure of the active site was solved. In addition the three acidic side chains, which are found in the 
same spatial arrangement in all polymerases, are shown. They are located at the bottom of the cleft 
(palm domain) and are essential for catalysis. The RasMol scripts allow to have a more detailed look for 
this. The distances between the three carboxyl groups are shown within the RasMol views (reference 
atoms are C gamma for Asp and C delta for Glu). The orientation of the side chains of T7 RNA 
polymerase is tentative since the coordinates of the side chains are not included in the PDB entry. 
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Overview ( RasMol script) Zoom (RasMol script) 




Carbon Oxygen Nitrogen Hydrogen Phosphorous Magnesium 



Figure 3. 

The active site of rat Pol beta (PDB entry 2bpf) . The DNA template, the DNA primer, the incoming 
mononucleotide dCTP, three aspartatic acid side chains and two magnesium ions are shown. The 3' 
hydroxyl group of the primer is not present in the crystal structure, but was add to illustrate the reaction 
mechanism. 

The primer as well as the incoming nucleotide are forming classical Watson-Crick hydrogen bonds to 
the template. The oxygen 03' of the primer nucleophilically attacks the alpha phosphorous of the 
mononucleotide. The transition state of this is stabilized by the left magnesium ion, whereas the right 
magnesium helps to position the incoming mononucleotide and stabilizes the leaving pyrophosphate. 
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Front View ( RasMol script) Side View (RasMol script) 




Figure 4. 

Two beta subunits (red and blue) of E. coli Pol III form a ring which wraps around the DNA. The 
coordinates of the protein were taken from PDB entry 2M (Kong et al. 1992). The DNA is an ideal B- 
type double helix, which was placed hypothetically to pass through the pore of the protein dimer. 



http://ww.imb-jena.de/~slickers/polymerases/figure.beta-total/ 



5/16/2002 



DNA Polymerases: Figure betoiubunit tertiary structure Page 1 of 1 



Peter Slickers ( slickers@imb-jena.de ) 

DNA Polymerases 

Front View ( RasMol script ) 



Figure 5. 

The tertiary structure of the beta subunit of Pol III is highly symmetrical. The ring shape of the protein is 
reflected by the arrangement of the 48 beta sheet and the 12 alpha helices. The alpha helices are lining 
the inner surface, while the beta sheets are forming the outer surface. 
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Front View (RasMol script) Side View ( RasMol script ) 





Figure 6. 

Each beta subunit consists of three domains with identical secondary structure and nearly the same 3- 
dimensional arrangement. One such domain is shown here in a Richardson type schematic diagram. The 
domain consists of two alpha helices and eight beta sheets and is symmetrical by itself. As can be seen 
from the side view either four beta sheets are arranged anti-parallel in a Greek Key motif. 
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Front View ( RasMol script) 




Figure 7. 

The tertiary structure of the eukaryotic processivity factor PCNA (proliferating cell nuclear antigene). 
PCNA (PDB entry lplq ) is a trimer of three identical subunits shown in red, yellow and green. The three 
subunits form a ring-shaped structure. PCNA is a DNA clamp and mediates high processivity to the 
eukaryotic polymerases delta and epsilon. Although PCNA consists of three monomers the secondary 
and tertiary structure are very similar to subunit beta of E. coli Pol III. 
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Structural and mechanistic 

relationships between 
nucleic acid polymerases 

Rui Sousa 

A superfamily of nucleic acid polymerases that includes the pol I and pol a 
classes of DNA-directed DNA polymerases, mitochondrial and phage DNA- 
directed RNA polymerases, and most RNA-directed polymerases may be 
defined on the basis of the occurrence of conserved sequence motifs and 
tertiary structure similarities between HIV-1 reverse transcriptase. DNA 
polymerase I and T7 RNA polymerase. Although sequence or structural 
similarities do not yet justify inclusion of the muiti-subunit DNA-directed 
RNA polymerases in this superfamily, mechanistic similarities suggest a 
deep relationship between these and the simpler T7-like RNA polymerases. 



THE TERTIARY STRUCTURE of a 
template-directed nucleic acid polym- 
erase, that of the Klenow fragment of 
DNA polymerase 1 (DNAP 0. was first 
described in 1985 (Ref. 1). Seven years 
were to pass before another polymerase 
structure appeared in the literature. 
During this period, development of the 
field depended largely on structure- 
function studies and on the identification 
of conserved sequence motifs among 
the increasing number of known polym- 
erase sequences. These efforts culmi- 
nated in an alignment that included 
most DNA-directed DNA polymerases 
(DNAPs), reverse transcriptases (RTs), 
RNA-directed RNA polymerases (RNAPs) 
and DNA-directed RNAPs (Ref. 2). How- 
ever, the tenuous nature of many of 
these sequence similarities cast doubt 
on the entire scheme, which therefore 
awaited confirmation and refinement, 
or rejection, based on further structure- 
function or structural studies. The past 
few years have seen the emergence of 
the structures of three new polym- 
erases 3 " 9 , which now make it possible to 
evaluate the significance of these pat- 
terns of apparent sequence conservation. 

Motif conservation 

The pattern of polymerase motif con- 
servation identified by Delarue et al. 2 . 

R. Sousa is at the Department of 
Biochemistry, Floyd Curl Drive. University of 
Texas Health Sciences Center. San Antonio, 
TX, USA. 
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Poch et a/. 10 and Mendez et ai u can be 
seen in Fig. 1. Although more extensive 
patterns of sequence similarity within 
certain polymerase families have been 
identified, we focus here on the limited 
set of motifs that are most widely dis- 
tributed. It can be seen that there is a 
correlation between polymerase tem- 
plate or substrate specificity and motif 
conservation. For example, motifs 
T/DxxGR and B are found only in polym- 
erases that use DNA templates, while 
motifs B' and D are restricted to polym- 
erases that use RNA templates and, 
within the RNA-directed family of 
polymerases, motif E is restricted to 
polymerases that use dNTPs. Motifs A 
and C unify the RNA* and DNA-directed 
RNA or DNA polymerases because they 
occur in polymerases of either template 
or substrate specificity. 

Relating sequence motifs to structure 

It is instructive to examine the over- 
all structures of these enzymes before 
looking at where the sequence motifs 
occur within them (Fig. 2). The similar- 
ity in the shape of the polymerase 
domains of T7 RNAP, p66 HIV-1 RT, and 
DNAP I to a 'cupped right hand* has led 
to the designation of the three sub- 
domains of the polymerase domain as 
'fingers', 'palm 1 , and 'thumb' 4 . The most 
extensive similarity is seen between T7 
RNAP and DNAP I : the folding and al- 
most all of the secondary structure in 
their respective polymerase domains is 
nearly identical, while the structural 

Z 1996. Elsevier Science Ltd 



similarity with HIV-1 RT is limited to 
a core comprising most of the palm 
subdomain. 

Motifs A and C, Peering more deeply 
into the large template-binding clefts 
of these enzymes, we can localize ac- 
tive sites that have been defined by 
structure-function studies, structural 
studies of polymerase-substrate/template 
complexes, and sequence comparison 
(Fig. 3), Most of the residues forming 
these active sites are part of the se- 
quence motifs shown in Fig. 1. Motifs A 
and C form three strands of a p-sheet 
and a short segment of a-helix within 
the core of the palm subdomain, which 
is structurally similar in RI T7 RNAP 
and DNAP I. Two amino acids (Asp537/ 
Asp812 in T7 RNAP; Asp705/Asp882 in 
DNAP I; Aspll0/Aspl85 in HIV-1 RT), 
which are identified as invariant within 
these motifs, are brought into alignment 
when the three polymerases are super- 
imposed. These two Asp residues bind 
and present two metal ions in the ap- 
propriate geometrical arrangement to 
catalyse a phosphoryl transfer reaction 
at the active site 12 . A third well-conserved 
carboxylate (Glu883 in DNAP I; Aspl86 
in RT; absent in T7 RNAP) is also ex- 
pected to be involved in catalytic metal 
binding. Significantly, mutation of this 
third carboxylate reveals that it is less 
critical for activity than either of the 
two invariant Asp residues 13 * 14 . 

Motifs B and B' are located in the fin- 
gers subdomains of DNAP I, T7 RNAP 
and RT, respectively. While motifs B and 
B' are dissimilar in sequence and struc- 
ture, and occur within a subdomain that 
is structurally dissimilar in the RNA- 
directed versus DNA-directed polym- 
erases, they are similarly positioned 
relative to the center of the active site 
in both classes of polymerases. In 
the structure of HIV-1 RT complexed 
with primer-template, the fingers sub- 
domain and elements from motif B' con- 
tact the template strand 5 . Modeling, 
structural and mutation studies imply 
that a region in the corresponding pos- 
ition (including elements of motif B) of 
the fingers subdomain of DNAP 1 or T7 
RNAP would be similarly involved in 
contacts with the template strand 15 . As 
the template in the RT primer-template 
structure does not extend downstream 
of the 3' end of the primer, downstream 
template contacts must be deduced 
from modeling. The more compelling 
model would place the downstream tem- 
plate contacts on p-strands 3 and 4 of 
RT, the loop between these strands and 
(perhaps) the car boxy-terminal region 
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Figure 1 

Patterns of motif conservation in nucleic acid polymerases 2 - 10,11 . Residues in blue are invariant. 
Other residues given are well conserved: h. hydrophobic residue: +. positively charged residue; 
-, any residue; .. a sequence gap. Residue numbers of invariant residues in DNA-directed DNA 
polymerase (DNAP) I. T7 DNA-directed RNA polymerase (RNAP) and HIV-1 reverse transcriptase 
(RT) are: for T/DxxGR motif - DNAP I, Arg668: T7 RNAP. Arg425: for Motif A - DNAP I, Asp705; 
T7 RNAP, Asp537; RT, AspllO: for Motif B - DNAP I, Lys758/Tyr766/Gly767; T7 RNAP, 
Lys63l/Tyr639/Gly640; for Motif B' - RT. Glyl52; for Motif C - DNAP I, Asp882; T7 RNAP, 
Asp812; RT, Aspl85; for Motif D - RT, Lys220; and for Motif E - RT, Gly231. 
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of [3-strand 11a (Ref. 16). However, it is 
also possible that these elements are in- 
volved in substrate contacts 1718 . The 
latter hypothesis would be consistent 
with recent evidence from structural 
and mutational analyses that the amino- 
terminal region of motif B (helix 0) in 
DNAP 1 interacts with the dNTP phos- 
phates and ribose moiety 19 - 20 . The 
docked dNTP modeled by Arnold and 
colleagues in the RT primer-template 
complex would not contact the fingers 
subdomain, but would instead establish 
contact with elements of motifs A and C 
and (possibly) with p-strand 11 (Ref. 16). 
It is, therefore, unclear if, the fingers 
subdomain is involved in substrate as 
well as template strand contacts, or if 
this represents a case where analogous 
structures in different polymerases have 
different functions (i.e. a role in sub- 
strate binding for the fingers of DNAP I 
and T7 RNAP. but not for RT). 

Functional roles of the T/DxxGR motif and 
motif E. Irrespective of the question of 
substrate binding, it is clear that the 
fingers subdomains and elements of 
motifs B and B' are involved in template- 
strand binding in both the DNA-directed 
and RNA-directed polymerases. It is, 
therefore, intriguing that structural 
similarity in the fingers subdomain and 
conservation of motifs B and B' reflect 
polymerase template specificity. In the 
same way. we can examine the location 
and proposed function of the T/DxxGR 
motif, which occurs in the DNA-directed 
polymerases, but not in the RNA- 
directed polymerases. Mutational stud- 
ies and modeling of template-DNAP I or 



template-T7 RNAP structures based on 
the RT primer-template complex reveal 
that the structure formed by this motif 
is also involved in template-strand con- 
tacts 15 . Along similar lines, it may be 
noted that motif E, the only one of five 
motifs conserved in RNA-<lirected DNA 
polymerases that is not also conserved 
in the RNA-directed RNA polymerases, 
forms a structure designated the 'primer 
grip', which is intimately associated 
with the primer strand 5 . 



Structural differences between T7 RNAP and 
DNAP I. It is intriguing that there is one 
region where DNAP I and RT are more 
similar to each other than to T7 RNAP, 
even though DNAP I and T7 RNAP show 
greater structural similarity overall. RT 
and DNAP I both exhibit a fourth 
3-strand ((3-strand 14 in DNAP I; I la, 
lib in RT). which extends the three- 
stranded sheet formed by motifs A and 
C. T7 RNAP lacks this fourth p-strand. 
As DNAP I and RT share substrate (and 



(a) HIV I RT P66 pol domain (b) DNAP I pol domain (c) T7 RNAP 

^ y Extra 1 

C-Term. Extra2 LI \A N-Term 
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Figure 2 

Structures of the polymerase domains of (a) P66 reverse transcriptase (RT). (b) DNA-directed DNA poKmerase .DNAP) I and (c) the complete T7 
DNA-directed RNA polymerase (RNAPi molecule : - 3-5 . The 'thumb' subdomains are colored green, the *oaim* sucdomains are in red, and the 'fin- 
gers' subdomains are blue. Structural elements in T7 RNAP that have no counterpart in the DNAP 1 po^merase domain are colored light gray 
■ 'Extra V. *N*TemV domarn. 'C-Term to Palm') or orange fExtra2'K The single magenta-colored helix in DNAP I and T7 RNAP is not formally con- 
sidered part of the polymerase subdomain. but is conserved between T7 RNAP and DNAP 1. The :wo gree^-colored spheres mark the positions of 
*he tr.;3na'*; Asp residues, .vhicn identify the center of the active site. 
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Figure 3 

Polymerase domain structures of (a) reverse transcriptase (RT), (b) DNA-directed DNA polymerase (DNAP) I with the 'thumb* subdomains re- 
moved to allow an unobstructed view into the active site, and (c) the T7 DNA-directed RNA polymerase (RNAP) structure with the thumb and 
amino-terminal domains removed. The polymerases are presented with the long axis of the template-binding cleft parallel to the long axis of the 
page. RT helices E, F and strands 6, 9 and 10 correspond to DNAP I helices Q, R and strands 9, 12, 13, and to T7 RNAP helices CC, DD and 
strands 5, 8 and 9, respectively. Trie structures formed by motifs A and C are in yellow; motifs B and B' are colored magenta; and the T/DxxGR 
motif {strands 3 and 4 in T7 RNAP; 7 and 8 in DNAP I) is orange. Motif E is colored green. Elements of the 'palm' not part of these motifs are in 
red, and the 'fingers' are blue. The carboxyi terminus of T7 RNAP is marked by an asterisk. T7 RNAP structures not a part of the polymerase 
domain are in light gray or orange. The region of RT in contact with the template strand in the RT-primer-tempIate structure is indicated by 
diagonal white lines 5 . The diagonal yellow lines indicate where additional template-strand contacts might be made if the template strand were 
extended beyond the length in the crystal structure of the complex 16 . The region in contact with the primer strand is in indicated by diagonal blue 
lines 5 . Elements of the RT molecule that could contact a substrate dNTP in the RT active site are indicated by diagonal magenta lines 16 . Green 
triangles indicate the locations of mutations that affect NTP K,/ or nucleoside analog utilization, but whose effects are thought to be mediated 
by effects on template binding 21 . Magenta triangles indicate mutations that affect NTP Km* or nucleoside analog utilization or become 
crosslinked to NTP, and whose effects are interpreted to be owing to direct NTP contacts 13 " 15 * 19-2149 - 50 . Blue circles indicate the positions of 
mutations that affect K d * for template or become crosslinked to the template and whose effects are interpreted to be owing to template-strand 
contacts 13 - 15,51 . Magenta circles indicate mutations that increase template K 6 * and whose effects are interpreted to be owing to primer-strand 
contacts. The green spheres indicate the positions of the catalytic Asp residues, 1^* and ^ effects are fivefold or greater. 



product) specificity, one possibility is 
that this structural divergence between 
the DNAPs and the RNAP is related to 
substrate/product specificity. Such a 
speculation is supported by the pro- 
posal that the AZT-resistance mutations 
at Lys219 of RT p-strand 11a exert their 
effects directly through contact with 
the dNTP (Ref. 21). It is also supported 
by the observation that mutations of 
Phe882 at the carboxyi terminus of T7 
RNAP (which superimposes on Lys219 
of RT) increase rNTPK^ (Ref. 22). % 

Alternatively, this structural pattern 
might be related to the fact that T7 RNAP 
uses a double-stranded template while 
DNAP I and RT use partially single- 
stranded templates. A fourth p-strand 
positioned in T7 RNAP analogously to 
the p-strand seen in DNAP I and RT 
could clash sterically with the domain 
we call 'Extral\ which is present in T7 
RNAP, but not DNAP 1 or RT, and which 
might be involved in unwinding. This 
fourth p-strand could also occlude a 
groove in T7 RNAP in which the un- 
wound non-template strand might bind. 
Truncation of the carboxyi terminus of 
T7 RNAP to remove the fourth p-strand 
might then reflect the need to remove 
structures that would sterically clash 
with the unwound non-template strand 
or with the protein domains respon- 
sible for unwinding. 
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Summary of motif structure-function corre- 
lationaZWithin a superfamily that includes 
the DNA pol I class of enzymes, the 
phage and mitochondrial RNAPs, the 
majority of RNAKlirected polymerases, 
and (perhaps) the po! a class of en- 
zymes, we can catalogue a set of corre- 
lations between conservation of struc- 
tural elements and the functions of 
those elements. Motifs A and C, and 
most of the palm subdomain, are con- 
served irrespective of polymerase- 
template or substrate specificity, reflect- 
ing a direct role for these structures in 
the activity common to all polym- 
erases: phosphodiester bond formation. 
Conservation of motifs T/DxxGR, B and 
the fingers domain in the DNA-directed 
polymerases, and conservation of motif 
B' and a distinct fingers domain in the 
RNA-directed polymerases reflects a 
role for these elements in template- 
strand binding. The unique position of 
Motif E, as the motif that is conserved 
only in the DNA-synthesizing polym- 
erases within the RNA-directed class of 
enzymes, reflects the role of this struc- 
tural element in product (i.e. primer) 
contacts. Finally, structural similarity in 
p-strands 14 and 11 of DNAP 1 and RT, 
respectively, and divergence with T7 
RNAP in the corresponding region 
might reflect a role for this structural 
element in substrate/product contacts 



or the structural requirements for utiliz- 
ation of a double-stranded template. 

Modularity in polymerase architecture 

One way to look at polymerase struc- 
ture is to imagine that polymerases are 
assembled from modules whose struc- 
tural conservation often reflects com- 
mon function, as illustrated in Fig. 4. 
The fingers, palm and thumb sub- 
domains together form the polymerase 
domain. The thumb subdomains, whose 
functions have not yet been addressed, 
are the least well conserved of these 
three subdomains, but appear to be 
similar in certain general features: they 
are extended, flexible, predominately 
a-helical structures that are involved in 
conferring processivity on polymeriz- 
ation by wrapping around bound tem- 
plate and/or interacting directly with 
the template to inhibit polymerase- 
template dissociation 23 * 26 . 

The function of the complete polym- 
erase domain appears limited to the 
minimal function of template-directed 
processive polymerization. Specific pol- 
ymerases can display additional activ- 
ities, but these activities appear to reside 
on distinct domains. Thus, T7 RNAP is 
capable of sequence-specific (promoter) 
DNA binding and template unwinding. It 
emerges that promoter-specific binding 
may be largely conferred on T7 RNAP 
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A modular architecture for polymerases is suggested. The white lines indicate how different structural elements are linked to each other. The 
functions of these structural elements and thekoccurrerx» Indifferent polymerases are tn<r^ 

by the /Extra2V^domaJn (Fig. 4), as 
amino acids critical for promoter recog- 
nition map to this loop 27 - 28 . Template- 
unwinding activity in T7 RNAP has not 
been mapped, but it is possible that the 
Extra2 domain plays a role in this pro- 
cess as it is placed at the leading end of 
the polymerase and forms two grooves 
into which the unwound strands of DNA 
might fit The major accessory domains 
of each of these polymerases (RNAse H 
in RT; Exo proofreading In DNAP I; 
aminotermlnal domain in T7 RNAP) are 
responsible for distinctive activities in 
each enzyme and bear no structural 
similarity to one another, although they 
do occupy roughly equivalent positions 
so that they can interact with nucleic 
acid strands upstream of the polym- 
erase active site. 

The scheme for polymerase organiz- 
ation presented in Fig. 4 suggests that 
gene fusion and recombination events 
might have played a role in the evolu- 
tion of modem multidomain polymerases 



from multi-subunit enzymes composed 
of more simple polypeptides. In this 
light, the structure of the middle RNAP 
of phage N4 is intriguing because it 
shows homology to the complete T7 
RNAP polymerase domain, but is com- 
posed of two subunits (designated P4 
and P7; L Rothman-Denes, pers. com- 
mun.). The break in the N4 RNAP places 
the amino terminal of the palm and 
thumb subdomains onto one subunit, 
and the conserved catalytic core and 
fingers structures on the other. Thus, in 
this Instance, the subunit separation 
coincides with one of the subdomain 
divisions presented in Fig. 4. 

Structure and extent of this polymerase 
superfamily 

Using the alignments of Delarue et 
at. 2 and Poch et at. 10 as a guide, it is ex- 
pected that the pol I class of DNAPs and 
the phage and mitochondrial RNAPs 
will display polymerase domain struc- 
tures with palms and fingers similar to 



those shown in Fig. 4, while the struc- 
tures of the thumb subdomains might 
show variation within the range re- 
vealed in this figure. The alignment by 
Delarue et al included the pol 0 class of 
DNA polymerases in its scheme for 
unification, but also suggested that the 
pol 0 class was exceptionally divergent 
because it was unique among the DNA- 
directed polymerases in lacking an 
identifiable motif B sequence. Analysis 
of the X-ray structure of pol 0 suggests 
that, in fact, this polymerase is not re- 
lated to DNAP I, RT or T7 RNAP and 
might be more closely related to 
nucleotidyl transferases, which are not 
polymerases 29 . The poi a class of DNA 
polymerases is expected to display a 
palm like that of DNAP 1 or T7 RNAP. 
however, changes in the spacing be- 
tween motifs A, B and C imply at least 
some topological rearrangements in the 
fingers domain of the pol a class of 
polymerases relative to the pol 1 class. 
For the RNA-directed polymerases. 
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analysis of the spacing between motifs 
also suggests that the palm domain 
structures will be relatively invariant, 
while there might be variations in the 
fingers-domain structures of the most 
distantly related classes. Alternatively, 
variations in motif spacing could be 
accommodated in 'extra' modules and 
variation in lengths of different elements 
on the periphery of these domains, 
which would leave the folding of their 
cores unchanged (as is seen, for example, 
in the comparison of the T7 RNAP and 
DNAP I structures). 

Status of the muiti-subunit RNAPs 

Not included in the Delarue et ai 
alignment were the muiti-subunit 
RNAPs (prokaryotic RNAPs, eukaryotic 
RNAPs I, II and III). The identified se- 
quence similarities between the cata- 
lytic subunits of these RNAPs and 
polymerases of the RT-DNAP I-T7 
RNAP superfamily 11 - 30 - 31 are not exten- 
sive enough to be conclusive without 
confirmation from structures at atomic 
resolution and, despite observations of 
similarity in the overall shape of the 
muiti-subunit RNAPs and T7 RNAP 32-34 , 
it might be more profitable to compare 
the extensive mechanistic, rather than 
the structural, similarities of these en- 
zymes. These include similarities in the 
timing, triggering and conformational 
changes involved in the transition from 
the poorly processive initiation phase 
of transcription to the elongation 
phase 35 -* 1 , and the utilization of similar 
promoter 42 - 43 and terminator se- 
quences 44 " 47 by both classes of polym- 
erases. The observation that the yeast 
RNAP holoenzyme is a dimer composed 
ofa core enzyme that is homologous to 
T7 RNAP and a polypeptide that is 
functionally and structurally similar to 
the sigma subunit of Escherichia coii 
RNAP 48 , might also be relevant to 
understanding the relationship between 
the single subunit and muiti-subunit 
RNAPs. 

Concluding remarks 

The significance of the observed 
pattern of structural similarities be- 
tween nucleic acid polymerases, the 
correlations between structural similar- 
ity and the functional role of different 
structural elements, and the mecha- 
nistic similarities between the muiti- 
subunit RNAPs and simpler polymerases 
like T7 RNAP have yet to be fully 
explored. Do the observed patterns 
reflect strict functional requirements 
for the utilization of different templates 
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or substrates, or do they reflect contin- 
gent events in the evolutionary history 
of polymerases? To what degree can the 
different functionalities displayed by a 
polymerase like T7 RNAP be distinctly 
mapped to different structural modules? 
Studies aimed at answering these ques- 
tions and others might profit from a 
perspective that integrates the recog- 
nized structure-function relationships 
of nucleic acid polymerases. 
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Abstract: The ubiquitous and essential nature of DNA predicts that enzymes responsible for DNA synthesis evolved 
early and share a common design and mechanism of action. DNA polymerases from many different organisms do exhibit 
striking similarities in their overall architecture, the design of the catalytic site, and the mechanism of nucleotidyl transfer. 
In spite of these shared features, however, DNA polymerases display an astonishing variety in structure and function, 
ranging from single subunit enzymes specialized for DNA repair to complex multi-protein holoenzymes responsible for 
genomic DNA replication. This review uses a few well-characterized model systems to summarize our current 
understanding of the workings of DNA polymerases in DNA metabolism. 

INTRODUCTION 

DNA polymerases catalyze the synthesis of deoxyribonucleic acid in a template-dependent process 
that results in a faithful copy of the original DNA molecule [1]. Consequently, these enzymes are 
necessary to propagate, maintain, and manipulate the genetic code of living organisms. The discovery 
of Polymerase I (Pol I) from E. coli in 1956 [2,3], and the crystal structure of the Pol I Klenow 
fragment in 1985 [4], heralded an explosion in DNA polymerase research, which continually yields 
new and surprising insights into the structure and mechanisms of these enzymes. The basic chemistry 
of nucleotidyl transfer for extending the DNA polymer one nucleotide at a time is universal, and all 
polymerases appear to utilize a similar catalytic mechanism for DNA synthesis [5]. However, an 
extreme diversity of DNA polymerases exists, not just among different organisms but within 
individual organisms. In Saccharomyces cerevisiae, for example, 8 different DNA polymerases had 
been discovered at the last count [6]. These enzymes function alone or with accessory factors, and 
vary substantially in the fidelity, speed, and processivity with which they replicate DNA. DNA 
polymerases are necessary not only for genomic DNA replication, but also for repair of DNA 
damage, DNA recombination, as well as the link between these processes and cell cycle regulation. 
Thus, the diversity in polymerases likely reflects the wide variety of metabolic situations in which 
DNA synthesis is required. 

Based on their functions, DNA polymerases can be broadly classified into two groups: A) 
replicative DNA polymerases (DNA replicases), that are responsible primarily for duplicating 
genomic DNA and B) repair polymerases, that primarily fix damaged DNA (including the new 
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family of polymerases responsible for DNA lesion bypass). Replicative DNA polymerases must 
synthesize extended lengths of DNA with high speed as well as high accuracy to ensure that each 
daughter cell receives a true copy of the genetic code on cell division. In general, these DNA 
replicases are complex macromolecular assemblies of several proteins that function together for 
efficient DNA replication [7,8]. In contrast to DNA replicases, polymerases dedicated to DNA repair 
generally have a simpler architecture and appear designed for DNA synthesis localized to areas of 
DNA damage. For example, replicative DNA polymerases tend to stall or dissociate from the 
template when they encounter lesions caused by ultraviolet radiation or chemical damage to DNA. 
On these occasions, specialized repair polymerases access the site and synthesize short DNA 
fragments to bypass the lesion, following which DNA replicases restart replication [9,10]. The DNA 
lesions are ultimately removed by DNA repair processes such as nucleotide excision repair [1 1], base 
excision repair [12-15], and mismatch repair [16,17] that employ several proteins including both 
repair-dedicated polymerases (e.g., E. coli Pol I in nucleotide excision repair) and replicative DNA 
polymerases (e.g., E. coli Pol III in mismatch repair). 

A more structure-based classification of DNA polymerases groups them into several different 
families based on similarities in amino acid sequence and structure [18,19]. The DNA polymerase I 
or Pol A family includes bacterial and bacteriophage polymerases such as E. coli Pol I, Thermus 
aquaticus DNA polymerase, and the bacteriophage T7 DNA polymerase, among others. These 
enzymes function in DNA replication (e.g., T7 polymerase, [20]), and repair (e.g., Pol I in nucleotide 
excision repair [11]). The polymerase B or Pol a family is more diverse and comprises bacterial 
enzymes (e.g., E. coli Pol II [1]), eukaryotic DNA replicative polymerases such as Pol a, Pol 6, and 
Pol 8 [6], replicative polymerases from bacteriophage T4 and RB69 [21], as well as the archaeal Pol 
B polymerases, among others [22,23]. The E. coliUNA polymerase III (a subunit) shares little 
sequence homology with any other polymerase and is therefore in a class by itself (polymerase C 
family) [24]. Polymerase P (family X), also unrelated to the above polymerases, is an eukaryotic 
enzyme responsible for filling in single nucleotide gap in DNA during base excision repair [12,13]. 
Recently identified archaeal polymerases related to Pol p are also included in family X [22,23]. The 
RT family contains RNA-dependent DNA polymerases including HTV reverse transcriptase and 
telomerases [25]. A new, rapidly growing family of polymerases contains DNA polymerases such as 
the E. coli UmuC [26] and DinB proteins [27], and the S. cerevisiae Pol r| and Pol C, [10], that are 
responsible for lesion bypass [28,29]. Finally, the DP2 family (named after the Pyrococcus Juriosus 
DP2 polymerase [30]) contains newly identified archaeal DNA polymerases that share little 
homology with the enzymes listed above [22,23]. These families are subdivided further according to 
particular similarities and differences in polymerase structure and function, highlighting the 
incredible diversity of these enzymes. In this review we focus on a few well-characterized 
polymerases to summarize our current understanding of the structure and mechanism of action of the 
catalytic polymerase protein as well as DNA polymerase holoenzymes (in which several proteins 
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function together to replicate genomic DNA). 

DNA POLYMERASE STRUCTURE AND FUNCTION 

Polymerases are unusual enzyme catalysts in that they use DNA, a substrate in the reaction, as a 
template to direct formation of the product, which is also DNA. In order to make a true copy of 
DNA, these enzymes are designed to: 1) recognize and bind a primed DNA strand, which serves as 
the template for synthesis, 2) recognize and bind a deoxynucleoside S'-triphosphate (dNTP) and 
match it with the template to form a base pair, 3) catalyze nucleotidyl transfer in which a covalent 
link is formed between the 3 f -hydroxyl group on the primer and the incoming dNTP, and finally 4) 
reposition the newly extended DNA polymer for the next cycle of nucleotidyl transfer [1], DNA 
polymerases can also proofread the newly synthesized DNA with a 3* - 5* exonuclease activity, 

which reduces their inherently low error rate of 1 in 10 5 base pairs to that approaching 1 in 10 8 base 
pairs during DNA replication [31-35]. 

Fig. (1). The shape of a DNA polymerase. The polymerase domain of the is. coli polymerase I Klenow fragment 
represents the typical topology of a DNA polymerase. The fingers, palm, and thumb subdomains are arranged to 
form a deep DNA binding cleft, with the catalytic site in the palm, at the base of the cleft. The proofreading 
exonuclease lies adjacent to the thumb domain and extends south of the polymerase domain in this view. The 
position of the exonuclease domain varies with respect to the polymerase domain in different enzymes (Pol I [4] 
and T7 polymerase [37] versus RB69 [71] and D. tok polymerases [40]). 

The DNA replication reaction presents several challenges for an enzyme catalyst. Predominant 
among these are the requirement that polymerase bind all four dNTPs and yet distinguish the correct 
dNTP from the other three in each catalytic cycle. The polymerase also has to bind DNA with high 
affinity and release it — but without complete dissociation — at the end of each reaction, so that it can 
be repositioned within the active site for the next reaction. Remarkably, DNA polymerases are 
capable of performing this complex task with high speed and fidelity. In the past four decades, the 
structure and kinetic mechanisms of DNA polymerases have been the subject of intense study, in an 
effort to understand how these biological machines work. The first crystal structure of a DNA 
polymerase (Klenow fragment of E. coli Pol I) yielded valuable insights into how polymerases are 
designed to catalyze DNA replication (Fig. 1) [4,19]. Subsequently, crystal structures of several other 
polymerases revealed that despite differences in amino acid sequence and structure among enzymes 
from different families, they share several features that are essential for activity (e.g., Pol I family: 
Klenow fragment [4], Thermus aquaticus pol [36], T7 DNA pol [37]; Bacillus stearothermophilus 
pol [38]; Pol n family: RB69 pol [39], D. tok Pol [40]; family X: pol p [41]; RT family: HIV-RT 
[42]. The shape of the polymerase domain is by far the most prominent feature common to all the 
polymerase structures determined to date. As described first for the Klenow fragment, the polymerase 
resembles a half-open right hand with the "palm" subdomain forming a cleft that is flanked by the 
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"fingers" and "thumb" subdomains. Together, the three subdomains hold the primer-template DNA 
and position the incoming dNTP for incorporation into DNA. The palm subdomain contains the 
catalytic site where the chemistry of nucleotidyl transfer takes place. The fingers subdomain interacts 
with and positions the template DNA strand and the incoming dNTP, while the thumb subdomain 
primarily binds the duplex DNA in a sequence-independent manner along the minor groove. Most 
DNA polymerases also contain a 3 f - 5' exonuclease domain within the same polypeptide, which 
proofreads newly synthesized DNA and corrects mismatched base pairs [5,43,44]. 

DNA Polymerase Mechanism 

Initial research on DNA polymerase mechanisms was also focused on the Klenow fragment (an 
enzyme specialized for DNA repair), and the use of rapid kinetic techniques revealed individual steps 
in the polymerase reaction pathway [45-48]. Further analyses of such enzymes as the T7 DNA 
polymerase [34], HIV reverse transcriptase [49], and the T4 polymerase [50], yielded the detailed 
mechanism by which replicative DNA polymerases synthesize DNA. Fig. 2 shows the polymerase 
reaction pathway with kinetic and thermodynamic parameters that define the catalytic activity of the 
T7 DNA polymerase. The core elements of this reaction mechanism generalize to the other DNA 
polymerases, although the equilibrium constants and rates of reaction vary [34]. 

Fi g. (2). Kinetic mechanism of DNA polymerase activity. The pathway of DNA synthesis catalyzed by DNA 
polymerases, with kinetic and thermodynamic parameters defining the bacteriophage T7 polymerase 
mechanism. Both DNA and the correct dNTP bind polymerase rapidly and with high affinity, following which 
the polymerase undergoes a slow, rate-limiting conformational change (Pol_DNA__dNTP to Pol*_DNA _dNTP) 
in preparation for nucleotidyl transfer. The chemistry step is fast, and is followed by another conformational 
change in the polymerase, which leads to product release and translocation of DNA for the next catalytic cycle. 
Figure adapted from [51]. 

The polymerase first binds a primed DNA template with high affinity (K d ~ 20 nM), and then binds 
a dNTP molecule to form a ternary Pol_DNA_dNTP complex [51]. The dissociation constant for 
interaction between polymerase and the correct nucleotide is 20 \iM versus 4 to 8 mM for nucleotides 
that do not match the DNA template [51,52]. Such discrimination against incorrect nucleotides 
suggests that the initial dNTP-binding step involves base-pair formation between the nucleotide and 
template DNA, during which the polymerase can accept a good match or reject a mismatch. Similar 
selectivity in dNTP binding has been observed in the case of HIV reverse transcriptase [49] and the 
T4 DNA polymerase [53]. Next, there is a slow, rate-limiting step in the reaction, prior to the 
chemical step of nucleotidyl transfer, which results in formation of the Pol*_DNA_dNTP complex. 
The rate of dissociation of DNA from this complex (and an analogous HIV reverse transcriptase 
complex) is 10-fold slower than from its precursor, the Pol_DNA_dNTP complex (Fig. 2) [34,49]. 
These data predicted that the polymerase changes conformation from an "open" state to a "closed" 
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state during the rate-limiting step, and clamps down on the DNA and dNTP in preparation for 
chemistry. Further investigation revealed that the enzyme goes forward with this conformational 
change only when a correct dNTP (which forms a Watson-Crick base pair with the template) is 
bound to the polymerase [51,52]. In the T7 polymerase, for example, at saturating concentrations of 
dNTPs, the incorporation rate of a correct dNTP is higher than the misincorporation rate by a factor 
of 2000 - 4000 [34]. The chemistry step remains hostage to the rate-limiting conformational change, 
which in turn is sensitive to the "correctness" of the new base pair formed by the nucleotide and 
template, and is slowed down by an incorrect nucleotide in the active site. In addition, the chemistry 
of nucleotidyl transfer is inhibited, contributing to the slow rate of misincorporation [52]. A partial 
proteolytic mapping study of the T7 DNA polymerase has also shown that the polymerase changes 
conformation only when bound to the correct nucleotide [54]. Thus, DNA polymerase uses an 
"induced- fit" mechanism for fidelity of nucleotide incorporation, in which a slow step in the reaction 
(predicted as a change in polymerase from "open" to "closed" state) severely restricts nucleotidyl 
transfer in the presence of an incorrect dNTP and a mismatched base pair at the active site. In 
contrast, after the correct nucleotide is bound and the polymerase changes conformation, the 
chemistry step occurs rapidly and extends the primer by one nucleotide. The polymerase then goes 
through a second conformational change that likely returns it to an "open" state, leading to 
pyrophosphate release and translocation of DNA for the next cycle of dNTP incorporation (Fig. 2). 
The following sections describe particular structural features of DNA polymerase that play a critical 
role in the mechanism of DNA polymerization. 

The Structural Basis for Polymerase Action 
A) Catalysis 

As described earlier, the nucleotidyl transfer reaction in which the 3' - hydroxyl group of the 
primer attacks the cc-phosphate of the incoming dNTP occurs in the catalytic site located in the palm 
domain of the DNA polymerase. The active site, shown in Fig. 3A (for T7 DNA polymerase [37]), 
contains several acidic and polar amino acid residues as well as two metal cations (usually Mg 2 *) that 
are essential for the proposed mechanism of catalysis [19]. In particular, two aspartate residues are 
absolutely conserved between the polymerase families, and these provide the carboxylate oxygens 
that ligate the metal ions [39,43]. Ion A is located near the 3 f -hydroxyl group of the DNA primer and 
the cc-phosphate of the incoming dNTP. In this location, ion A is ideally positioned to lower the pK a 
of the hydroxyl group and facilitate formation of the hydroxide anion which can initiate nucleophilic 
attack on the cc-phosphate of dNTP (Fig. 3B). Metal ion A also ligates the a-phosphate of dNTP, 
helping to stabilize the pentacovalent transition-state formed during the reaction. Metal ion B ligates 
oxygens in all three phosphate groups of the dNTP, likely helping align the triphosphate moiety for 
attack by the 3'-hydroxyl, as well as stabilizing the charge on the transition state. Other polar residues 
in the active site and possibly ion B help stabilize the charged pyrophosphate group as it dissociates 
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from the polymerase after nucleotidyl transfer is complete. This two-metal ion mechanism for 
nucleotidyl transfer was originally proposed by Steitz, based on an analogous mechanism determined 
for the 3' - 5* exonuclease [55]. Remarkably, the catalytic mechanism and the geometry of the active 
site are highly conserved even in completely unrelated polymerases such as the T7 DNA polymerase 
[37] and Pol (3 [56], reflecting the evolutionary success of this mechanism for DNA synthesis. 

Fig . (3). The catalytic site. (A) The T7 DNA polymerase active site shows two metal ions (A and B) coordinated 
by conserved aspartate residues, a main chain oxygen, and water molecules, that in turn ligate the oxygens in the 
three phosphate groups of the incoming dNTP, (B) The 3' - hydroxyl group on the primer initiates nucleophilic 
attack on the a phosphate of the incoming dNTP, which results in phosphodiester bond formation between the 
primer and nucleotide and pyrophosphate release. Figure adapted from [37]. 

B) Selection of dNTP 

In addition to the layout of the core catalytic site, the crystal structures reveal several important 
features that allow DNA polymerases to synthesize DNA with high fidelity and processivity. The 
ability of a DNA polymerase to faithfully complement a DNA template depends on how well it 
selects for correct pairing between the template and the nucleotide during initial base pair formation, 
during extension of the base pair, and finally on the editing activity of the 3' - 5' exonuclease. The 
crystal structures of polymerases in complex with DNA and an incoming nucleotide reveal how a 
polymerase might initially select the correct nucleotide (e.g., T7 DNA polymerase [37], pol P [41], 
HIV-RT [57], and B. stearothermophilus polymerase [38], among others). As the template DNA 
strand enters the active site, it bends away from the cleft between the fingers and thumb and wraps 
around the fingers subdomain. This conformation exposes the next base in the template for pairing 
with the incoming dNTP (Fig. 4 for the T7 polymerase). The dNTP-binding site is located in a 
narrow junction between the fingers and thumb domains (Fig. 4 for T7 polymerase and Fig. 5D for 
Klentaql polymerase). The 3' - end of the primer lies right next to the dNTP-binding site, and 
together with residues from the fingers domain it forms a highly constrained binding pocket for the 
new base pair [37]. The T7 polymerase contacts the bases through numerous van der Waals' 
interactions and can therefore precisely recognize the geometry of the base pair. The DNA Pol p 
structure shows extensive hydrogen bonding between the polymerase active site and the new base 
pair, which also facilitates recognition [41]. Accordingly, DNA polymerases can select for correct, 
Watson-Crick base pairs and reject distorted, mismatched base pairs when the incoming dNTP 
initially fits into the binding pocket [58,59]. 

Fi g. (4). T7 DNA polymerase«DNA«dNTP complex (with thioredoxin). The structure of the T7 DNA polymerase 
shows the primer-template in the DNA-binding cleft, with the primer 3' - end held in the polymerase active site, 
and the template bent to expose the next base for pairing with the incoming dNTP. The fingers domain presses 
close to the active site to form a tight binding pocket for the nucleotide and the new base pair. The thumb 
domain contacts the duplex region of the primer-template in the minor groove and increases the polymerase's 
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grip on DNA. The thioredoxin processivity factor appears twisted away from the DNA, and is perhaps in an 
open conformation in this structure. Figure adapted from [37]. 

Biochemical studies of DNA polymerase mechanism describe an intramolecular rate-limiting step 
in the reaction, after initial dNTP binding to the polymerase-DNA complex and before its 
incorporation into DNA (Fig. 2) [34]. This step is predicted to be a conformational change that is 
induced by binding of the correct dNTP to the active site and results in a tightly bound 
Pol*_DNA_dNTP complex that is competent for catalysis. The first clues regarding the structural 
nature of this step were provided by crystal structures of the Pol p_DNA_dNTP ternary complex, the 
Pol p_DNA binary complex, and the apo form of Pol P (free of ligands) [41,56,60]. Compared to the 
latter two structures, Pol p complexed with nucleotide and DNA has a relatively "closed" 
conformation in which domain D (the fingers domain) lies very close to the newly formed base pair 
and appears sensitive to the geometry of its fit in the active site. The T7 polymerase structure in 
complex with a nucleotide and primed DNA also has a relatively closed conformation when 
compared with related Pol I family polymerases that are either unliganded or bound to DNA only 
[37,61]. In the T7 Pol_DNA_dNTP complex, the fingers domain (most prominently the O helix) is 
angled in toward the palm domain and DNA (Fig. 4), whereas in the other enzymes (apo form or 
Pol_DNA complexes) the fingers are angled away from the active site (~ 40_ difference between the 
two orientations), resulting in a more open conformation. The authors proposed that transition 
between these open and closed forms of the polymerase corresponds to the rate-limiting 
conformational change that DNA polymerase undergoes after binding the correct dNTP (Fig. 2) 
[37,61]. 

Most recently, a series of structures of a single polymerase, the Klenow fragment of Thermus 
aquaticus DNA polymerase (Klentaql) provided direct evidence for closure of the fingers domain 
toward the active site [62,63], Fig. 5A and SB show the structures of a Klentaql_DNA binary 
complex and a Klentaql *_DNA_dNTP 

ternary complex, respectively. The Klentaql *JDNA_dNTP structure was obtained from crystals that 
were analyzed immediately after growth in the presence of ddCTP and primed DNA, and thus 
represents a closed conformation in which the enzyme is associated tightly with both the DNA and 
the correct nucleotide (Fig. SB). The Klentaql_DNA structure was obtained by depriving the closed 
ternary complex crystals of DNA and ddCTP for a month, and thus represents a conformation in 
which the enzyme is bound only to DNA, having released the nucleotide (Fig. 5A). As expected from 
the earlier analysis of Pol P and Pol I family polymerase structures, the most striking difference 
between the two forms of Klentaql is in the conformation of the fingers domain. The Klentaql_DNA 
form is fully open, with the tip of the fingers domain (O helix) twisted away from the body of the 
enzyme such that a crevice is visible at the top of the cleft formed by the fingers, thumb, and palm 
domains (Fig. 5A). The Klentaql *_DNA_dNTP form is fully closed, such that the crevice is no 
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longer visible (Fig. 5B). A third structure of Klentaql was obtained by depriving ternary complex 
crystals of DNA and ddCTP for a few days only. This procedure yielded an intermediate ternary 
complex (Klentaql JDNA_dNTP) that contains nucleotide in the binding site, but has an open 
conformation similar to that of the KlentaqlJDNA binary complex. Superimposition of the two 
ternary complexes (Fig. 5C) highlights the inward movement of the fingers domain as the 
polymerase changes from open to closed conformation. In the closed conformation, the 0 helix lies 
much closer to the active site and helps orient the DNA template for base-pairing with the incoming 
dNTP (Fig. 5D). Furthermore, in the closed conformation the nucleotide is buried deep in the active 
site (in an environment favorable for chemistry), whereas in the open conformation, the nucleotide is 
readily accessible to the solvent (Fig. 5E). 

Fig. (5). Klentaql polymerase conformations. (A) An image of the open KlentaqlJDNA binary complex, with 
primed DNA bound in the cleft. The blue arrow predicts the motion of DNA during translocation. (B) The 
ternary Klentaql *»DNA»dNTP complex shows the inward movement of the fingers domain, which closes the 
crevice at the top of the DNA-binding cleft, trapping the DNA and nucleotide in the active site. (C) Stereo 
diagrams comparing the open (magenta) and closed (yellow) forms of Klentaql bound to DNA and dNTP. (D) 
and (E) Stereo diagrams of the active site, showing the contacts between the polymerase (O helix), DNA, and 
ddCTP in the closed form and open form, respectively. Note the displacement of Tyr 671 from its stacking 
arrangement with template base G T5 in going from the open form (E) to the closed form (D), which allows the 
next template base (G T4 ) to get in position for base pair formation with the ddCTP. Figure adapted from [62]. 

The structural data strongly suggest that the KlentaqlJDNA and Klentaql *_DNA_dNTP 
complexes respectively represent DNA polymerase conformations before binding dNTP and after the 
rate-limiting step, when the polymerase is predicted to clamp down on its substrates in preparation 
for chemistry (Fig. 2). The presence of the intermediate open Klentaql _DNA_dNTP complex 
implies that the enzyme can transition between binding dNTP, sensing it for "correctness" and 
releasing the wrong dNTP, before finally closing tightly around the right one. The new base pair 
must fit in a tight binding pocket (Fig. 4; Fig. 5D); therefore, the incompatible geometry of a 
mismatched base pair may trigger transition to an open conformation and dNTP release, while the 
correct fit of a Watson-Crick base pair allows the polymerase to proceed to the nucleotidyl transfer 
step. 

The nucleotide binding site in DNA polymerases can also discriminate between dNTPs and rNTPs 
(ribonucleoside triposphates) such that the polymerase synthesizes DNA not RNA. For example, in 
the T7 DNA polymerase, the C-2 carbon of the ribose moiety fits in a tight pocket that cannot 
accommodate the 2'-hydroxyl group of a rNTP [37]. Other polymerases utilize a "gating" mechanism 
to sterically block the 2 , -hydroxyl group of an incoming rNTP. In the Klenow fragment, for example, 
a glutamate residue in the nucleotide-binding site plays a key role in excluding the 2 5 -hydroxy 1 group 
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from the site [64]. A similar mechanism (using a phenylalanine residue) for blocking rNTP entry has 
been reported for the moloney murine leukemia virus reverse transcriptase [65]. Such intricate design 
features allow DNA polymerases to selectively synthesize DNA in spite of the 10-fold higher 
concentrations of rNTPs over dNTPs in the cell. 

C) Processivity 

The polymerase mechanism also predicts that after nucleotidyl transfer the polymerase switches 
back to an open conformation, from which the pyrophosphate and DNA products are released (Fig. 
2). In most polymerases, however, complete dissociation of DNA occurs very slowly (0.2 per second 
for T7 DNA polymerase [51]; Fig. 2). Instead, these enzymes function processively by translocating 
on DNA and repeating the polymerization reaction, thus extending the primer by more than one 
nucleotide per template binding event [7]. Replicative DNA polymerases in particular are highly 
processive, and employ accessory protein factors to replicate thousands of nucleotides per DNA 
binding event. The T7 polymerase bound to thioredoxin exhibits a processivity of - 1500 base pairs 
at 20 _C (processivity = rate of polymerization / rate of DNA dissociation) [51,66,67]. Other DNA 
polymerases, such as Pol I [47] and HIV-RT [49], do not utilize accessory proteins and yet are 
capable of incorporating several nucleotides per DNA binding event. Thus, the inherent design of the 
DNA polymerase plays an important role in processivity. The crystal structures of DNA-bound 
polymerases, such as the T7 DNA polymerase [37] and Klenow fragment [68] among others, reveal 
that the thumb domain forms a cap-like structure over the DNA binding cleft and covers a significant 
portion of the duplex. Residues in the tip of the thumb contact the primer-template upto several bases 
away from the active site, and appear to increase the grip of the polymerase on DNA [37]. Consistent 
with the structural data, removal of a 24 amino acid domain from the tip of the Klenow fragment 
thumb substantially reduces its affinity for DNA and also reduces its processivity [69]. Similarly, 
mutations in the thumb domain of HIV-1 reverse transcriptase increase the DNA dissociation rate 
and thus reduce processivity [70]. Simply the tight binding of polymerase to DNA solves only half 
the problem, since processivity also requires that the polymerase release DNA and translocate at the 
end of each catalytic cycle. The structures of Klentaql bound to DNA in Fig. 5A and 5B show the 
DNA almost completely enclosed in a cylinder-like groove formed by the fingers and palm and 
capped by the thumb [62]. In the closed conformation, the DNA is wedged tightly within the active 
site and does not move. In the open conformation the DNA is still enclosed in the groove, but may 
have the space to may move freely without complete dissociation from the polymerase. Thus, in the 
open conformation the polymerase can directly translocate to the next base on the template for 
processive DNA replication. 

D) DNA Editing 

On the rare occasions when a mismatch is incorporated into DNA, the primer strand partitions to 
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the 3' - 5' exonuclease site on the polymerase where the mismatch is rapidly excised [19,34]. In the 
crystal structures determined so far, the polymerase and exonuclease active sites are separated by a 
distance of about 30 - 40 A (e.g., Klenow fragment [4]; RB69 polymerase [39]. The DNA must 
move from the polymerase site and unwind a few base pairs to present the primer strand to the 
exonuclease site for editing. The structure of the RB69 polymerase JDNA editing complex shows 
how the DNA is partially melted and bent such that 3 - 4 bases of the primer enter a deep cleft, 
ending with the 3'-primer terminus in the active site (Fig. 6A) [71]. The exonuclease active site 
contains two metal ions that are ligated by acidic residues, and function analogously to the ions in the 
polymerase active site. One ion helps deprotonate a water molecule to generate the hydroxide ion for 
nucleophilic attack on the phosphorus in the phosphodiester bond. Both ions coordinate the oxygens 
of the phosphodiester bond, thereby stabilizing the geometry and the charge of the transition-state, 
and facilitating exit of the oxyanion nucleotide product. 

A terminal mismatched base pair in the primer-template (or even mismatches in the -2 or -3 
position) inhibits the next reaction cycle by blocking the rate-limiting conformational change in the 
polymerase (thereby stalling the polymerase in the "open" conformation), and by drastically slowing 
the rate of chemistry (300 per second to 0.01 per second for the T7 DNA polymerase) [52]. During 
this pause in the reaction, the DNA is released from the polymerase site and directed to the 
exonuclease site for correction. Mismatched base pairs distort the DNA duplex, and it has been 
proposed that their incorrect geometry may trigger translocation of the primer strand to the 
exonuclease active site for editing [34,72]. The T7 DNA polymerase structure reveals conserved 
amino acids in the fingers and palm domains that contact the final base pairs in the primer-template, 
and are in position to sense distortions in the minor groove caused by mismatches [37,61]. Mutations 
of analogous amino acids in the Klenow fragment reduce its affinity for the primer-template DNA 
and also slow the rate of DNA synthesis [73,74]. These results are consistent with the idea that 
mismatch recognition via a geometry-sensing mechanism changes the interaction between DNA 
polymerase and the primer-template, and may trigger conformational changes that stall the 
polymerase and allow the DNA to switch to the exonuclease site. 

A comparison of the RB69 polymerase_DNA editing complex with a model structure of DNA 
bound to the polymerase site shows how both the polymerase and DNA undergo changes during the 
switch from DNA synthesis mode to DNA editing mode (Fig. 6B) [71]. The double-stranded portion 
of the primer-template changes its trajectory by a 30_ angle as the primer strand is unwound and 
inserted into the exonuclease site. In the polymerase, the thumb domain shifts from a "closed" 
conformation (in the apo form) to a more "open" conformation, and a p hairpin structure that 
normally blocks access to the exonuclease is also shifted, allowing the primer clear access to the 
exonuclease active site. Furthermore, contacts between the P hairpin and the primer appear to 
stabilize the melted duplex and likely aid the entry of single-stranded DNA into the exonuclease and 
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facilitate DNA editing [75-78], Similarly, in the Klenow fragment, numerous hydrogen-bonding and 
hydrophobic contacts stabilize single-stranded DNA in the exonuclease site [55,79]. 

Fig . (6), DNA editing. (A) Structure of RB69 DNA polymerase with DNA bound to the editing site. The 
polymerase and exonuclease active sites are indicated by P and E, respectively, and are 38 A apart. (B) 
Comparison of the RB69 polymerase in editing mode (exo) versus the polymerase mode (pol). The p hairpin 
partitioning the two active sites is highlighted. Figure adapted from (71j. 

Since single-stranded DNA is the preferred substrate of the exonuclease, the stability of the DNA 
duplex likely influences the transfer of DNA between the polymerase and exonuclease active sites 
(T4 DNA polymerase [53,80]; E. coli Pol EI [81]; Klenow fragment [82,83]). Mismatches disrupt the 
hydrogen bonding between base pairs and increase the melting capacity of DNA, which in turn may 
increase the potential for DNA editing [84,85]. Biochemical studies examining the influence of base 
pair hydrogen bonding on polymerase and exonuclease activity utilize nucleoside analogs that mimic 
the shape of native dNTPs but lack the ability to form hydrogen bonds [86,87]. When paired with 
natural bases in the terminal base pair of a primed DNA template, these analogs mimic Watson-Crick 
base pair geometry but form unstable base pairs because of the lack of hydrogen bonding. A recent 
study shows that the weak T : Z (4-methylbenzimidazole) and A : F (2,4-difluorotoluene) base pairs 
are edited with similar efficiency as mismatched base pairs such as T : C or A : A (in spite of their 
resemblance to the correct A : T base pair structure), while the correct, hydrogen bonded A : T base 
pair is edited at a significantly slower rate by the Klenow fragment [88]. These results further 
substantiate the hypothesis that base pair instability (as in a mismatch) and consequent DNA melting 
stimulates exonuclease activity and therefore has a strong influence on the editing mechanism. 

Once single-stranded DNA binds to the exonuclease site it can be hydrolyzed very rapidly, so the 
question arises as to how the enzyme prevents extensive and unecessary degradation of DNA. The T7 
polymerase mechanism indicates that DNA can easily slide back and forth between the two active 
sites, and that binding of the primer-template to the polymerase active site is energetically favored 
over binding to the exonuclease [34,72]. In other words, the DNA has a very small lifetime at the 
exonuclease site compared to the polymerase site. Consequently, DNA excision is not very 
processive, and after a mismatch is corrected, the DNA can exit rapidly from the exonuclease. The 
polymerase site rebinds the corrected DNA with high affinity and returns to the task of DNA 
synthesis. Studies of the T4 [89], herpes simplex virus [90], and <>29 [91] polymerases suggest that 
these enzymes also allow DNA to slide freely between the two active sites. Whether this property 
maintains low exonuclease processivity and controlled DNA editing in other polymerases, as in the 
T7 polymerase, remains to be seen. Ongoing studies of different polymerases continue to clarify 
structural and kinetic parameters governing polymerase and exonuclease activity, and are anticipated 
to provide greater insights into the mechanisms by which polymerases replicate DNA with high 
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fidelity. 

HOLOENZYME STRUCTURE AND FUNCTION 

During genomic DNA replication, several different proteins function at the DNA replication fork as 
part of a replisome (Fig. 7) [7,8]. A DNA helicase unwinds duplex DNA to create a single-stranded 
template for the polymerase [92]. Single-stranded DNA binding protein (SSB) stabilizes the template 
strands [93], while primase synthesizes short RNA primers on DNA that serve as start sites for DNA 
replication [1]. The polymerase catalyzes DNA synthesis and proofreads the new DNA strand with 
its y - 5' exonuclease activity, as described above. Genomic DNA replication additionally requires 
that the DNA polymerase function with high speed and fidelity, as well as exceptionally high 
processivity. Moreover, the polymerase must also coordinate leading and lagging strand DNA 
replication, which is a challenging task given that the leading strand is synthesized continuously in 
the direction of the replication fork and the lagging strand must be synthesized discontinuously in the 
opposite direction, due to the anti-parallel nature of DNA [1]. Most polymerases acquire the 
specialized properties required for genomic DNA replication by associating with one or more 
accessory proteins. These proteins work together with the polymerase in a holoenzyme particle that is 
capable of simultaneous replication of leading and lagging strand DNA [94], at close to 1000 
nucleotides per second [94], and with an error rate approaching 1 in 10 7 base pairs [95]. The 
following sections focus on the composition, architecture, and the mechanisms of action of DNA 
polymerase holoenzymes, using E. coli DNA polymerase III as a model system to understand the 
workings of these multi-protein machines in E. coli and other organisms. 

Anatomy of a Holoenzyme 

DNA polymerase holoenzymes generally contain three distinct components: a) a core DNA 
polymerase, b) a sliding clamp processivity factor, and c) a clamp loader. These components appear 
conserved in all cellular organisms ranging from bacteria to humans (Table I). The DNA polymerase 
III holoenzyme (Pol HI) is made up of ten proteins, each of which contributes a different activity 
toward DNA replication (Fig. 7) [24]. The core DNA polymerase (oceG) comprises: a, the DNA 
polymerase, e, the 3' - 5' proofreading exonuclease, and 9, whose function is unclear. Like most 
other polymerases, the core DNA polymerase has very low processivity and dissociates from the 
DNA after incorporating only about 10 to 20 nucleotides [96]. This problem is alleviated by the p 
protein, a sliding clamp that binds the polymerase with high affinity and increases its stability on 
template DNA [96,97]. Pol III also contains a 5-protein clamp loader, y complex, that assembles the 
p processivity factor onto DNA for use by the core polymerase [98]. The tenth protein, x, holds 
together two core polymerases, thus facilitating simultaneous synthesis of leading and lagging 
strands by the holoenzyme [99]. x also binds the y complex, serving as a connector that holds the 
polymerase and clamp loader in one holoenzyme particle [99]. Furthermore, x interacts with the 
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DNA helicase (DnaB [100,101]) and the primase (DnaG [100]), stimulating their activity and helping 
coordinate holoenzyme function with other replisomal proteins at the DNA replication fork (Fig. 7). 

Fig. (7). The DNA replisome. At the DNA replication fork, the helicase unwinds duplex DNA to make single- 
stranded template, which is stabilized by SSB, and primase synthesizes an RNA primer on the template for 
initiation of replication. The holoenzyme, comprising two core DNA polymerases (<X80), a connector protein (t), 
a clamp loader (y complex), and two sliding clamps (P), replicates leading and lagging DNA strands 
simultaneously. (A) The primase maintains its grip on the new primer through an essential contact with SSB. (B) 
The % subunit of y complex severs the primase-SSB contact as the clamp loader takes its place on the primer and 
assembles the circular p clamp around it The primase recycles to another site on the template to continue 
synthesizing primers. (C) When the lagging strand polymerase finishes an Okazaki fragment, it releases the 
clamp and DNA, and reassembles at the upstream primed site (with P on it) to start a new fragment. Figure 
adapted from [139]. 

Eukaryotic DNA polymerase holoenzymes have similar components, including Pol 8 (and Pol s?) 
the replicative DNA polymerase, PCNA, the sliding clamp, and RFC, the clamp loader [102]. So far, 
there appears to be no equivalent to the E. coli x protein connector, although, the RFC subunits share 
significant sequence homology with x [103], and at least one RFC subunit appears to bind Pol 8 
[104,105]. The three eukaryotic holoenzyme components may also interact with each other directly to 
form a stable holoenzyme particle. The 5. cerevisiae Pol 8 (a three subunit polymerase) forms a 
dimer, which is likely responsible for simultaneous replication of leading and lagging DNA strands 
[106]. Similarly, the four-subunit Schizosaccharomyces pombe DNA Pol 8 forms a dimer, which may 
serve the same function [107]. The PCNA clamp is bound tightly by Pol 8 during DNA replication, 
and studies with the human DNA polymerase indicate that RFC also associates with the holoenzyme 
through direct interaction with Pol 8 [104,105], possibly helped by a contact with RPA (the 
eukaryotic SSB) on DNA [105]. The Cdc45 protein, which interacts with the eukaryotic replicative 
helicase MCM [108], and moves with the replication fork [108], may also be involved in organizing 
the eukar yotic replisome. 

Table I. Components of the DNA Replisome 

Processivity Factors 

A) Circular Sliding Clamps 

Defined as the ratio of the DNA polymerization rate versus the DNA dissociation rate, processivity 
reflects the ability of a polymerase to extend DNA per template binding event. Replicative DNA 
polymerases tend to be highly processive and can extend a new strand by several thousand 
nucleotides without falling off the template [1]. Studies of DNA replicases from various organisms 
have revealed different mechanisms by which polymerases acquire processivity. By far the most 
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common mechanism involves the use of "sliding clamps" as accessory factors. Sliding clamps are 
ring-shaped proteins that encircle double-stranded DNA and form a topological link with it [110]. 
This stable yet nonspecific, sequence-independent interaction with DNA allows the ring to slide 
freely along the duplex. These clamps also bind DNA polymerase, and thus serve as mobile tethers to 
keep the polymerase associated with the template during DNA synthesis. 

The term "sliding clamp" was first used to describe the ability of bacteriophage T4 polymerase 
accessory factors (gp45 clamp in concert with gp44/62) to stimulate DNA replication [111-112]. 
Early biochemical studies with the E. coli Pol III holoenzyme suggested that one of its accessory 
subunits, p, is in fact the sliding clamp— a ring-shaped protein that encircles DNA [97]. The 
experiments showed that p assembled on a circular DNA molecule remains stably bound, but if the 
DNA is linearized with an endonuclease, the clamp dissociates rapidly and completely from the DNA 
[97]. It was hypothesized that the circular clamp slides freely on DNA and can therefore fall off the 
ends off a linear DNA molecule. This hypothesis was confirmed by the crystal structure of P, which 
revealed a protein ring about 80 A in diameter with a central hole about 35 A across (Fig. 8A) [113]. 
Two P monomers interact in a head-to-tail fashion to form a tightly closed ring with no apparent 
breaks at the interfaces. The central hole is more than large enough to accommodate A- or B-form 
DNA, and positive electrostatic potential on the inside of the clamp complements the negatively 
charged phosphate backbone of DNA. The absence of specific DNA binding elements inside the ring 
suggests that P interacts with the double helix through non-specific, water-mediated interactions, and 
explains how it can slide freely on DNA [1 10,1 13]. 

Soon after the E. coli clamp structure was solved, the crystal structures of sliding clamps from 5. 
cerevisiae (yPCNA) [114], humans (hPCNA) [115], bacteriophage T4 (gp45) [116], and 
bacteriophage RB69 [71] were determined (Fig. 8A). The proteins form remarkably similar tertiary 
and quaternary structures despite the lack of significant similarity between their primary amino acid 
sequences (except for the T4 and RB69 clamps, which are closely related). All the clamps are highly 
symmetrical toroids, with similar dimensions and a central hole of about 30 - 35 A in diameter. The 
most striking difference between the clamps is that PCNA and the T4 and RB69 monomer units are 
about two-thirds the size of p, therefore three monomers interact with each other to form the clamp 
instead of two as in p. All other structure-function aspects of the clamps appear closely related; the 
monomers form an unbroken ring that can encircle duplex DNA, and each monomer consists of two 
(PCNA, T4, RB69) or three (p) domains with repeating motifs that maintain the ring structure, as 
well as inter-domain connector loops that are important for interactions with other proteins. As in p, 
the inside channel of the rings is wide, positively charged, and lined with a helices that do not make 
any specific contacts with DNA, allowing the clamps to slide freely on the duplex behind the 
polymerase (Fig. 8A). 

Fig. (8). Sliding clamps and their dynamic interactions with DNA polymerase. (A^Front views of circular sliding | 
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clamps from bacteriophage T4 (gp45), bacteriophage RB69, E. coli (p), S. cerevisiae (yPCNA), and humans 
(hPCNA). (B) A model structure of the RB69 DNA polymerase tethered to primer-template DNA by the sliding 
clamp. {Q and (D) Models illustrating how the E. coli DNA polymerase associated with its P sliding clamp might 
bypass a secondary structure on DNA by sliding over it (C) or remove it by a strand displacement mechanism 
(D). Figure adapted from [71,120], 

Recently, the bacteriophage RB69 clamp structure was solved in complex with a carboxyl-terminal 
peptide from the RB69 DNA polymerase (Fig. 8B). This 11 -residue, C-terminal peptide is essential 
for interaction between the polymerase and clamp (in both phage RB69 and T4), and is likely the 
major point of contact between the two [117-119]. The structure of the DNA polymerase shows the 
C-terminus extending out from behind the main body of the enzyme, much like a tether that hooks the 
polymerase onto the sliding clamp (Fig. 8B) [39,71]. Hydrophobic residues among the last seven 
amino acids of the tether form tight contacts with a small hydrophobic pocket on the sliding clamp, 
while the remaining residues extend away from the clamp toward the main body and are structurally 
more flexible. This type of connection may be of advantage when the polymerase encounters pause 
sites during DNA replication and has to adjust its position on DNA [120]. For example, following 
misincorporation of a nucleotide, the polymerase pauses while the DNA switches between the 
polymerase active site and editing site. A flexible tether between the polymerase and clamp could 
facilitate such movements while minimizing the danger of the polymerase dissociating from the DNA 
altogether. In a recent study, the p clamp, as well as human PCNA, were found capable of sliding 
over secondary structures such as single-stranded loops, small stem-loops, and bubbles in DNA 
[121]. Furthermore, when the Pol in holoenzyme encountered such small obstacles, it did not fall off 
DNA but rather traversed the obstacle (possibly by piggybacking on the clamp) and resumed 
synthesis at a downstream 3 '-terminus (Fig. 8C). Given secondary structures that were too large for 
the clamp to slide over, the polymerase could clear them by a strand-displacement mechanism or 
eventually dissociate from DNA (Fig. 8D) [121]. In vivo, such mechanisms may avert complete 
dissassembly of the holoenzyme at every pause site and facilitate rapid DNA replication, while repair 
processes fix the anomalous structures in DNA at a later time. Although structural details of the 
interaction between P and the core DNA polymerase are not yet clear, a flexible connection, similar 
to that detected in phage RB69, might allow the polymerase to shift position on DNA as it attempts to 
traverse or displace obstacles in its path. 

B) Monomeric Tethers 

Several replicative DNA polymerases acquire processivity through means other than the circular 
sliding clamps described above. One such enzyme is the bacteriophage T7 DNA polymerase that co- 
opts a bacterial host protein, thioredoxin, to increase its processivity from a few nucleotides to more 
than a thousand nucleotides per template binding event [122,123]. Thioredoxin functions as a 
monomer and forms a 1 : 1 complex with the polymerase by interacting with its extended thumb 
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domain, as shown in Fig. 4 [37]. The thumb domain covers a substantial portion of the DNA binding 
cleft and stabilizes the polymerase on DNA. It has been hypothesized that thioredoxin bound to the 
thumb further extends the "cap" structure and converts the enzyme itself into a clamp, thus increasing 
its grip on DNA [61]. Although the thioredoxin molecule in the crystal structure does not cap the 
DNA binding cleft (Fig. 4), this extended conformation may represent an open form of the enzyme, 
which is required for initial interaction with DNA and for DNA release after the reaction is complete. 
It is also possible that thioredoxin does not trap DNA within the polymerase, but rather binds DNA 
through non-specific electrostatic contacts, which allow it to slide on DNA and thus function as a 
mobile tether for the polymerase. 

Prominent among other polymerases that utilize monomeric processivity factors are the eukaryotic 
mitochondrial DNA polymerase y, which is composed of a large catalytic subunit and a smaller 
accessory subunit that increases its processivity [124,125], and the herpes simplex virus DNA 
polymerase, which utilizes the UL42 protein for high processivity [126]. A model structure of the 
DNA polymerase y accessory subunit predicts that it shares some structural similarity with 
thioredoxin, but there is no direct evidence yet that polymerase y employs the same mechanism for 
processivity as the T7 DNA polymerase [127]. The UL42 protein is even more of a mystery, since a 
recent report shows that its structure is very similar to that of a PCNA monomer (Fig. 8A), despite 
the lack of any sequence homology between the two [128]. Furthermore, a C-terminal peptide of the 
HSV DNA polymerase is critical for binding to UL42 [129], suggesting that interaction between the 
polymerase and UL42 is through a flexible tether, much like that observed for the RB69 polymerase 
and clamp [71,128], Yet unlike the RB69 clamp and PCNA, UL42 functions as a single monomer in 
a 1 : 1 complex with the HSV DNA polymerase [126]. Although it is not yet clear how the UL42 
structure corresponds with its function as a processivity factor, it binds duplex DNA with high 
affinity [130,131], and may utilize electrostatic interactions to slide on DNA, as has been proposed 
for thioredoxin. The UL42 protein may reflect a precursor form of the circular sliding clamps as its 
structure strongly resembles that of a sliding clamp subunit but its action appears more primitive 
(without formation of a multimeric ring or topological linkage with DNA). 

Differences in the structures of various processivity factors are mirrored by differences in their 
mechanisms for conferring processivity on DNA polymerases. As described earlier, processivity is a 
function of two factors: the propensity of the polymerase to extend DNA versus its propensity to 
dissociate from DNA. In keeping with the proposed capped structure of T7 DNA polymerase and 
thioredoxin on DNA, a study of T7 polymerase-catalyzed DNA synthesis indicates that thioredoxin 
causes a significant decrease in the dissociation of the polymerase from DNA [51,122]. However, 
thioredoxin also increases polymerase processivity by increasing the rate of DNA polymerization. It 
should be interesting to determine how an accessory protein bound so far from the catalytic site 
influences the rate of polymerization (Fig. 4). UL42 also reduces the rate of HSV polymerase 
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dissociation from DNA, but appears to have no effect on the rate of polymerization [131]. In contrast, 
the 8-fold increase in overall processivity of human mitochondrial DNA polymerase y in the presence 
of its accessory subunit is almost entirely due to a substantial increase in the polymerization rate 
[132]. The accessory subunit has no effect on the dissociation of y polymerase from DNA, implying 
that it does not function by physically trapping the polymerase on DNA, as suggested for thioredoxin 
and the circular sliding clamps. The diversity in structures and mechanisms for processivity even 
among the few DNA replicases examined in detail suggests that several different methods have 
evolved to effectively meet the essential requirement of rapid genomic DNA replication by DNA 
replicases. 

Circular Clamp Assembly on DNA 

A) Clamp Loader Structure 

The crystal structure of a sliding clamp provided a clear explanation of how it serves as mobile 
tether for the DNA polymerase. It also raised an intriguing question; how does the closed protein ring 
form a topological link with DNA? In order to serve as processivity factors for DNA polymerase, 
sliding clamps must encircle primed DNA sites that serve as starting points for DNA replication. 
These proteins cannot self-assemble around DNA and must be loaded onto DNA by a protein 
complex called the clamp loader (Fig. 7). The clamp loader functions as a molecular matchmaker, 
using energy from ATP binding and hydrolysis to bring together a clamp and primed DNA and form 
a topological link between the two [133]. DNA replicases from a variety of organisms ranging from 
bacteriophage to humans utilize such ATP-fueled molecular matchmakers for processivity (Table I). 

In E. coli, early studies of the Pol III holoenzyme had shown that 2 ATP molecules are hydrolyzed 
during formation of an initiation complex in which the polymerase is tethered to primed DNA by the 
p sliding clamp [134]. More recently, the E. coli clamp loader, y complex, was found to hydrolyze 2 - 
3 ATP molecules for assembly of one p clamp on DNA, confirming that its ATPase activity is 
directly coupled to formation of a processive DNA polymerase [135]. The E. coli clamp loader is a 
complex machine composed of two to four y, one 5, one 5', as well as one each of the % and y 
subunits [98,136]. The three subunits, y, 8, and 5', are essential for clamp assembly (Fig. 9A) [137]; 
the % subunit aids holoenzyme assembly onto primed DNA [138,139] (discussed in a following 
section), and the function of \\f is unknown, although it appears to stabilize the y complex under high 
salt conditions [140]. The basis for clamp assembly lies in 5, which is the only subunit that binds p 
and is capable of opening the ring without help from the other subunits or a nucleotide cofactor 
[135]. However, when 5 is assembled within y complex, it has very low affinity for p and cannot 
open the clamp in the absence of ATP. Under these conditions, the p binding site on the 8 subunit is 
buried within y complex, likely occluded by the 8' subunit. In the presence of ATP, the y complex 
undergoes a conformational change that allows 8 to bind p [135,141]. The crystal structure of the 8' 
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subunit provides some clues as to the nature of this conformational change. 5' is a "C"-shaped 
protein with globular top and bottom domains connected by a flexible hinge domain [142]. The y 
subunit shares extensive sequence homology with the 5' subunit, and a model structure of y based on 
8' indicates that y is also "C'-shaped (Fig. 9B). The y subunit contains the Walker A and B ATP 
binding motifs, and is the only subunit in y complex that binds and hydrolyzes ATP [143,144]. The 
ATP-binding site abuts the hinge domain in y, predicting that ATP binding and hydrolysis affect the 
flexible hinge and drive movement of the top and bottom domains relative to each other [142]. Thus, 
y is the "motor" subunit in y complex, and clamp assembly begins when at least two y subunits bind 
ATP, resulting in a change in y complex conformation (Fig. 9A) [145]. This conformational change 
moves the 8' subunit and exposes 8 for interaction with p. A study of p clamps covalently cross- 
linked at one or both interfaces of the dimer indicates that opening of one interface is necessary and 
sufficient for loading p onto DNA. Thus, on ATP binding, the 8 subunit likely opens the p clamp at 
one interface in preparation for clamp assembly (Fig. 9A). 

Prior to ATP hydrolysis, the ATP-bound clamp loader also binds primed DNA to form an 
intermediate complex in which the open clamp and DNA substrate are held in close proximity, 
possibly in position to form a topological link with each other [145]. Completion of clamp assembly 
requires ATP hydrolysis, as ATPyS (an ATP analog not hydrolyzed by y complex) cannot substitute 
for ATP in clamp assembly, p and DNA binding experiments performed with ATP and ATPyS show 
that the ATP-bound form of y complex has high affinity for p and DNA, but both substrates are 
released when ATP is hydrolyzed [145]. A minimal model pathway for clamp assembly suggests that 
once the clamp loader_ATP_open clampJDNA intermediate is formed, ATP hydrolysis reverts y 
complex conformation back to the original state where 8' once again binds 8 and disrupts its 
interaction with p. Consequently, p and DNA are released from y complex, topological^ linked to 
each other [135,145]. Investigation of y complex activity using rapid kinetic techniques indicates that 
coupling between the ATPase reaction and clamp assembly is a bit more complex than the simple 
two-step model mechanism outlined above [146], The y complex binds two ATP molecules rapidly, 
at an apparent bimolecular rate of 4 x 10 5 M" 1 second 1 (Fig. 9A). Interaction with primed DNA also 
occurs rapidly, on the order of 10 8 M" 1 second" 1 (measured with y complex alone [147]) and may 
occur before or after interaction with p. Next, a slow, intramolecular step occurs at 12 - 15 per 
second, and this has been attributed to opening of the p ring. Following ring opening, the two ATP 
molecules are hydrolyzed sequentially by y complex. Hydrolysis of one ATP occurs at > 80 second' 1 
and is followed by a rate-limiting step in the reaction. Thus, the second ATP is hydrolyzed much 
slower than the first, at an apparent rate of 2 second" 1 . Hydrolysis of one ATP molecule appears 
sufficient for release of P from the y complex, although it is not clear yet whether p release occurs 
after the first or the second ATP is hydrolyzed [146]. The sequential mechanism is indicative of 
multi-step coupling of ATP hydrolysis to clamp assembly, and the initial evidence suggests that each 
hydrolysis step is coupled to a distinct functional event in the pathway. Once the clamp 
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loader_ATP_open clamp_DNA intermediate is formed, the clamp loader may need to position the 
clamp, or DNA, or both, such that the DNA lies in the center of the open clamp. Further 
rearrangements may be necessary to close the clamp around DNA, release the clamp and DNA, and 
for the y complex to revert back to its original conformation. These events are coordinated and likely 
coupled in a step-wise fashion to sequential ATP hydrolysis and release of ADP and P. from y 

complex. 

Clamp loaders from other organisms are also multi-component machines that utilize energy from 
ATP binding and hydrolysis to assemble circular clamps on DNA (Table I). The eukaryotic clamp 
loader, Replication Factor C (RFC), contains five subunits that are highly conserved from yeast to 
humans (S. cerevisiae RFC1 - 5; human pl40, p37, p36, p40, p38) [8,148]. All five proteins also 
share high homology with the y and 5' subunits ofE. colU the gp44 subunits of the bacteriophage T4 
clamp loader, gp44/62, as well as RFC1 and RFC3, the two subunits of the archaeal clamp loader 
[23,103]. The sequence similarities imply that these clamp loader proteins have a 'C'-shaped 
structure like y and 8', and might employ this flexible, ATP-sensitive shape like y complex for the 
mechanics of opening the circular clamp and closing it around DNA. Another striking similarity 
between y complex and clamp loaders from other organisms is that they all contain several ATPase- 
active subunits. The most highly conserved motifs in all the proteins listed above are the classic 
Walker A and B sequences, which form the phosphate-binding loop (P loop) and Mg 2+ binding sites, 
respectively, and are required for ATP binding and hydrolysis (Fig. 9B). At least four of the five 
human RFC proteins contribute to RFC ATPase activity, since mutations in their ATP-binding sites 
reduce PCNA loading activity and processive DNA replication [149]. Multiple yeast RFC subunits 
also exhibit ATP binding/ATPase activity [150,151]. The bacteriophage T4 clamp loader contains 
four copies of the gp44 subunit [152], and the archaeal clamp loader mthRFC, isolated from 
Methanobacterium thermoautotrophicum AH, contains several copies of the RF1 and RFC3 subunits 
[23]. This organization of multiple ATPase-active subunits in clamp loaders suggests that the 
sequential ATPase mechanism employed by y complex may generalize to clamp loaders from other 
organisms. 

The bacteriophage T4 clamp loader, gp44/62, catalyzes assembly of the circular gp45 clamp onto 
DNA in a reaction driven by binding and hydrolysis of four ATP molecules [153,154]. Like y 
complex, gp44/62 binds the gp45 clamp on binding ATP [155,156]. The clamp must open wide for 
assembly around primed DNA, and different studies suggest that clamp opening occurs prior to ATP 
hydrolysis (as seen for y complex and p) or concurrent with the hydrolysis of two ATP molecules 
[157,158]. The gp44/62_gp45 complex also binds DNA, after which hydrolysis of the other two ATP 
molecules appears to drive conformational changes in gp44/62 and close the gp45 ring around DNA 
[159]. This proposed mechanism is similar to that of y complex in that the ATPase activity appears 
sequential (two ATP molecules instead of one are hydrolyzed in each step), but the manner in which 
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ATP hydrolysis is coupled to clamp assembly may be different. Further details on both gp44/62 and y 
complex activities should facilitate better comparison of the two clamp loaders. 

Fi g. (9), Clamp loader structure and mechanism. (A) A model structure of y, based on the 6' crystal structure, 
shows the "C"-shape of the protein with a top and bottom domain connected by a hinge domain. The ATP site in 
y lies in close proximity to the hinge domain, implying that ATP binding and hydrolysis might induce global 
changes in the shape of the protein. (B) A model mechanism of ATP hydrolysis-coupled clamp assembly shows 
that at first two y subunits in y complex rapidly bind ATP. The ensuing change in y complex conformation 
exposes 8, resulting in interaction with P and slow opening of the P ring at one dimer interface. The ATP-bound 
clamp loader also binds primer-template DNA rapidly. Next, sequential ATP hydrolysis followed by ADP and P 

dissociation drive the step-wise rearrangement and release of P and DNA, resulting in p linked topological^ 
around DNA. The arrows indicate that changes in p or DNA may be coupled to the first or second ATP 
hydrolysis step. Figure adapted from [142,146]. 

In eukaryotes, ATP binding to RFC facilitates stable interaction between RFC, PCNA, and DNA, 
and ATP hydrolysis results in a PCNAJDNA complex that is used effectively by the polymerase for 
processive DNA replication [160,161]. Like the 8 subunit in y complex, the p40_p37_p36 
subcomplex of human RFC, and possibly the p40 subunit alone, is capable of opening the PCNA ring 
in the absence of ATP [162]. Applying the y complex model mechanism for clamp loading to RFC 
suggests that ATP binding might induce a change in RFC conformation that exposes the p40 or 
p40_p37_p36 subcomplex and allows it to bind and open PCNA. Subsequent ATP hydrolysis would 
then drive formation of a topological link between PCNA and DNA, release of PCNAJDNA, and 
turnover of RFC, although these steps have not yet been examined individually. In spite of the overall 
similarities between RFC and y complex, their mechanisms may differ in detail. For example, in 
addition to the p40_p37_p36 subcomplex, the large pl40 subunit also binds PCNA [163,164], and 
this interaction may be required for both ring opening (measured as removal of clamps from circular 
DNA) and clamp assembly [162]. Moreover, unlike E. coli 5, the p40jp37_p36 subcomplex also has 
a DNA-stimulated ATPase activity, which could be important for coupling ATP hydrolysis to 
assembly of the clamp around DNA [162,165]. The role of each subunit (and its ATPase activity) in 
the mechanics of PCNA loading is still under investigation. Extensive genetic and in vitro studies of 
the yeast and human clamp loader proteins also suggest that RFC is involved in cell cycle cycle 
checkpoint pathways [148]. For example, the large RFC subunit (pl40) can be phosphorylated by 
protein kinases such as PK II, and the phosphorylated form is less active in DNA replication [166], 
This has led to speculation that the clamp loading activity of RFC may be modulated by mechanisms 
that control progression of the cell cycle. Site-specific cleavage of pi 40 by proteases such as caspase- 
3 has also been detected, implicating RFC activity in apoptosis-linked pathways [167,168]. Future 
studies are anticipated to not only clarify the mechanism of RFC-catalyzed clamp assembly but also 
shed light on how RFC activity is linked to cell cycle control mechanisms. 
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Holoenzyme Assembly on DNA 

Clamp assembly on DNA is only one event in an ordered series of reactions that occur during 
assembly of the holoenzyme on DNA and the start of DNA replication (Fig. 7). In the beginning, the 
DNA duplex is unwound by a helicase [92] and single-stranded (ss) DNA binding protein (SSB) 
stabilizes the resulting ss-DNA templates [93]. Recent studies have revealed that SSB plays a far 
more complex role in DNA replication than simply binding to ss-DNA [138,139]. In fact, in both 
bacterial (E. coli) and eukaryotic (human) DNA replication systems, it plays a central role in 
coordinating the interactions between various proteins during assembly of a functional polymerase 
holoenzyme on DNA [105,139]. 

The primase (E. coli DnaG) targets to single-stranded DNA, possibly aided by interaction with the 
DnaB helicase [169] and synthesizes small RNA primers up to 12 nucleotides in length (Fig. 7 A). 
The primase remains tightly associated with the primer on DNA, stabilizing the small primer on 
DNA and protecting it against nucleases [139]. Next, the primase is displaced from the primer by the 
clamp loader (y complex), which has to assemble the P clamp on primed DNA (Fig. 7B). This 
switching of proteins at the primed DNA site occurs through a competition between the primase and 
the clamp loader for the SSB protein. After synthesizing the RNA primer, the primase remains on the 
RNAJDNA aided by stable interaction with SSB coated on the adjacent ss-DNA (Fig. 7A). A single 
subunit of the clamp loader (%) also binds SSB and by virtue of this interaction it destabilizes the 
primase-SSB complex, resulting in displacement of the primase from DNA. The released primase can 
now recycle to another site on ss-DNA, while the y complex proceeds to load p at the primed site 
(Fig. 7B) [139]. Clamp assembly is followed by another protein switch between the y complex and 
the polymerase [170]. Both proteins compete directly for the same binding site on the P clamp. In 
solution, y complex binds the clamp with higher affinity than the polymerase, but releases the clamp 
after loading it on primed DNA. In contrast, the polymerase has weak affinity for p in solution, but 
binds the clamp tightly once it is loaded on DNA. Thus, after y complex completes p assembly on 
DNA and leaves, the polymerase can finally access the primer and clamp and initiate DNA 
replication [170]. 

Similar "protein switching" mechanisms for polymerase assembly have been discovered in the 
eukaryotic DNA replisome as well [102,105]. In the human system, polymerase/primase a 
synthesizes an RNA primer of about 35 - 50 bases on a single-stranded DNA template [171]. As 
observed in E. coli, Pol a remains firmly attached to the primed DNA site through interaction with 
RPA, the eukaryotic single-stranded DNA binding protein. In the first protein switch, the clamp 
loader, RFC, binds RPA and disrupts the RPA-Pol a interaction, resulting in displacement of Pol a 
from the DNA. Subsequently, RFC assembles a PCNA clamp on the primed DNA, but unlike y 
complex, it remains bound to RPA and the PCNAJDNA complex [105]. In the second switch, the 
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replicative DNA polymerase 8 competes with RFC for PCNA (Pol 6 and RFC bind the same site on 
PCNA [172]) and RPA, and gains access to PCNA_DNA to complete holoenzyme assembly and 
initiate DNA synthesis. As in E. coli, the continuous presence of proteins on the primer (first Pol a, 
then RFC, and then Pol 8) protects it from degradation by nucleases. The pol a to RFC to Pol 8 
switch may also impose specificity to the replication reaction, as polymerases other than Pol 8 (e.g., 
Pol I tested in this study) may not be capable of extending a primer bound by RFC [105]. 

Holoenzyme assembly occurs infrequently on the leading DNA strand since it is replicated in a 
continuous fashion (Fig. 7). On the lagging strand, however, the polymerase must finish an Okazaki 
fragment and restart a new fragment once every second or so [1,7]. When the lagging strand 
polymerase hits the end of an Okazaki fragment, it releases both the sliding clamp and DNA and has 
to be reassembled at an upstream primed DNA site as observed, for example, with the DNA 
polymerase HI holoenzyme [100] (Fig. 7B). As described above, holoenzyme assembly involves a 
series of reactions with changing protein-protein and protein-DNA interactions until the polymerase 
is finally in place on DNA. Completion of one reaction sets up another reaction, apparently because 
the next protein component has high affinity for the product of the previous reaction. The ordered 
nature of these reactions helps coordinate rapid cycling of the lagging DNA polymerase and thus 
ensures smooth and timely progression of the DNA replication fork. 

General Application of the Principles of Holoenzyme Assembly 

Protein switching mechanisms may operate in other situations during DNA replication, and 
possibly even in other DNA metabolic processes where several different proteins compete for one 
site of action. One likely candidate for a central role in protein switching mechanisms is the circular 
sliding clamp. Several enzymes, including the replication- and repair- specific endonucleases, Fenl 
[173] and XPG [174], respectively, DNA ligase I (that seals nicks in DNA) [175], and DNA 
cytosine-5 methyl transferase (that methylates newly synthesized DNA) [176], bind the eukaryotic 
clamp, PCNA. All these enzymes contain the PCNA binding motif (a small, conserved sequence of 
hydrophobic amino acids similar to the RB69 polymerase C-terminal peptide that binds the clamp; 
Fig. 8B), and presumably compete with each other and with DNA polymerase for binding to PCNA 
[177,178]. Ordered competition for the sliding clamp may impart a temporal sequence to the 
assembly and action of these enzymes in various DNA processing reactions. For example, Fenl (flap 
endonuclease) processes Okazaki fragments by cleaving the 5' flap formed when DNA polymerase 
invades a downstream fragment by strand displacement synthesis [179]. If Fenl competes 
successfully with the polymerase for PCNA in this situation, it could swiftly snip 5* flap structures 
before strand displacement proceeds too far. The PCNA binding motif is also found on the C- 
terminus of p21 cn> ', a cell cycle regulator that binds human PCNA and inhibits DNA replication 
[102,115,180]. Thus, PCNA may also play a central role in ordered reactions that link DNA 
replication with cell cycle control mechanisms. 
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In a DNA repair process such as nucleotide excision repair (NER) and mismatch repair, several 
proteins localize to a lesion (or mismatch) in DNA, excise the DNA containing the lesion, and then 
fill in the gap by DNA synthesis. In £ coli, the process of NER involves interactions between UvrA, 
UvrB, UvrC proteins, and DNA for ordered assembly and activity of the Uvr(A)BC excinuclease on 
DNA [11] UvrA functions as a molecular matchmaker to load UvrB on DNA, and then dissociates to 
allows UvrC into the complex. After excision, UvrC must leave before polymerase I can access the 
V - terminus and re-synthesize DNA across the gap. Furthermore, nucleotide excision repair is also 
coupled to transcription through the transcription-repair coupling factor (TCRF). TCRF can 
recognize RNA polymerase stalled at a lesion in DNA, facilitate disassembly of the transcription 
complex, and recruit the Uvr(A)BC excinuclease in a highly concerted reaction. Even more proteins 
function together in eukaryotic excision repair systems, including the DNA replication proteins, 
RPA, RFC, and PCNA [11]. In DNA mismatch repair, as in NER, several different proteins are 
involved in recognizing and fixing mismatched basepairs and insertion/deletion loops in DNA 
[16,17]. The highly coordinated activity required of proteins in each DNA repair pathway, and 
between the processes of DNA repair and replication (and recombination), implies that competitive 
protein switching mechanisms analogous to those observed in holoenzyme assembly may be 
employed in several DNA metabolic reactions. 

NEW FRONTIERS 

New technologies for crystal structure determination and rapid kinetic analysis have revealed 
intricate details of the design and mechanism of action of DNA polymerases. In vivo these enzymes 
often function in the context of multi-component assemblies in which accessory proteins influence 
polymerase activity as, for example, in DNA polymerase holoenzymes at the replication fork. 
Purification of these proteins from different and complex organisms, including yeast and humans, as 
well as reconstitution of holoenzymes in vitro is proceeding rapidly at this time. These advances, 
accompanied by an understanding of holoenzyme biochemistry, are laying the foundation for detailed 
structural and kinetic studies which are anticipated to further clarify how these large and complex 
biological machines work on DNA. 

It should be noted that interest in archaeal DNA replication has intensified in the past few years, in 
part due to the complete sequencing of several archaeal genomes. The DNA replication machinery in 
these organisms is by and large analogous to that in eukaryotes, except that archaeal proteins appear 
less complex in structure and composition [23]. Consequently, these proteins present attractive model 
systems for studies of holoenzyme / replisome structure and function, such as outlined above. An 
added advantage provided by these systems is the opportunity to understand how enzymes 
responsible for DNA replication and repair have adapted to extreme conditions. 

The newly identified lesion bypass polymerases present yet another frontier in DNA polymerase 
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research. These enzymes function in DNA replication for error-free or error-prone bypass of DNA 
segments damaged by ultraviolet radiation or chemicals [910,28]. In S. cerevisiae, for example, when 
the DNA replicase stalls at a cis-syn thymine-thymine dimer (which distorts the DNA template), Pol 
x\ can access the primer-terminus and insert two adenines opposite the T-T dimer [181-183]. The 
enzyme has low processivity and dissociates rapidly from DNA, allowing the replicase to rebind 
DNA after lesion bypass and restart DNA replication. Although Pol r| appears to recognize an A : T 
base pair, it functions with low fidelity compared to replicative DNA polymerases, implying it may 
be less sensitive to distorted DNA in the active site. Other lesion bypass polymerases are more 
promiscuous (e.g., S. cerevisiae Revl/Pol Q, and will insert any base opposite the T-T dimer, leading 
to error-prone lesion bypass [184]. Thus, even among this new and relatively small group of enzymes 
there are intriguing signs of variations on the polymerase theme. It will be interesting to determine 
how lesion bypass polymerases are designed for polymerization activity that contrasts dramatically 
with that of replicative DNA polymerases. 

In summary, on the simple foundation of a two-metal ion mechanism for catalyzing phosphodiester 
bond formation, a vast array of DNA polymerases has evolved, facilitating not only the development 
of complex organisms, but also providing those complex organisms with the opportunity for 
continually exciting research in this field. 
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Function 


Bacteria 
(E. coli) 


Bacteriophage 
(T4) 


Eukarya 
(humans / S. cerevisiae) 


i\rcnaea 


Helicase 


DnaB 
(homohexamer) 


gp41 
(homohexamer) 


MCM 
(6 subunits) 


MCM homolog 
(1 to 4 subunits) 


Primase 


DnaG 


gp6l 


Pol cc/primase (4 
subunits) 


Primase homolog 


Single-strand 


SSB 


gp32 


RPA (3 subunits) 


RPA homolog 


Polymerase 
(3 '-5' exonuclease) 


core polymerase 
(3 subunits: a,e,0) 


gp43 


Pol 8 (3-4 subunits) 
Pole? (_ 5 subunits) 


Family B polymerases 
DP2-like polymerases? 


Sliding clamp 


P (dimer) 


gp45 (trimer) 


PCNA (trimer) 


PCNA homolog (trimer) 


Clamp loader 


Y complex 


gp44/62 
(4 gp44 + 1 gp62) 


RFC 
(5 subunits) 


RFC homolog 
(two subunits) 
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E. coli Pol I Klenow Fragment 
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T7 DN A polymerase mechanism 
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A. T7 DNA polymerase active site 




B. Two metal ion mechanism 
for nucleotidyl transfer 




T7 DNA polymerase with thioredoxin 
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Thioredoxin 
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B. Polymerase versus editing mode 
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A. Sliding clamps 



T4gp45 RB69 clamp 




http://www.bentham.org/coc-sample/o-donnell/fig8a.jpg 




B. RB69 polymerase with clamp on DNA 




C Strand traversal D. Strand displacement 
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A. Clamp loading mechanism 

j-^a^ B. Model structure of y 
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